THE STRUCTURE OF 

HUMAN ABILITIES 


PHILIP E. VERNON 

]VLA., Ph.D. 


Professor of Educational Psychology 
in the Institute of Education, 
University of London 


LONDON: METHUEN & CO. LTD. 
NEW YORK: JOHN WILEY & SONS, INC. 



First published in 1930 


CATALOO0B NO, S291/U (METHUEN) 


PRINTED IN GREAT BRITAIN BY THE ALOHN PRES'j 
WELWYN GARDEN CITY, HERTS. 



Dedicated without permission, but with profound respect 


to 

a BURT and g. h. Thomson 

(with whom I almost always agree) 

and to 

L. L. THURSTONS and the late c. e. spearman 
(with whom I usually disagree) 




PREFACE 


S OME acquaintance with the principles and results of factor 
analysis is nowadays necessary to the study and practise of 
educational, vocational, or other branches of applied psycho- 
logy. The mathematical techniques of factorization are less 
essential for those who are not actually going to carry out analyses, 
and|they are omitted in this book. Even without them the topic 
is a difficult one, but I have tried to present it simply and to show 
that it does have extremely important practical bearings I assume 
only that the reader has had an elementary course in psychology 
and knows what an intelligence test and a correlation coefficient 
are. One section of the book which is inevitably more technical 
than the rest is relegated to an appendix. 

My other chief aim is to bring together the large number of 
publications in this field, in Britain and America, which at first 
sight appear to give contradictory and confusing accounts of 
mental structure, and to show that they can be fitted into one 
consistent — even if incomplete — ^picture This involves the re- 
working or reinterpretation of the results of many authors. Almost 
all the contributions from about 1935 to the middle of 1949 are 
critically surveyed (though less attention is paid to earlier litera- 
ture), and several unpublished analyses carried out by British 
psychologists in the Services are described. The book does not, 
however, attempt to cover studies of personality factors, attitudes 
and interests, or other fields outside that of abilities. 

Too many colleagues to mention by name have encouraged me 
to write this book, or have made useful comments on earlier drafts. 
I am indebted to them, also to my wife for her assistance in pre- 
paring the manuscript and index. Acknowledgment is due to the 
National Institute of Industrial Psychology for permission to re- 
print several sections of an article (Vernon, 1949a). It may be 
noted here that dates are inserted after authors’ names as refer- 
ences to the bibliography. 


P. E. V. 
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CHAPTER I 


MENTAL FACULTIES AND FACTORS 

Abstract. This chapter criticizes the ascription of human 
abilities and behaviour to hypothetical faculties, traits or powers 
of the mind, as exemplified in phrenology and in various educa- 
tional and other theories An ability or factor should be thought 
of as a class or group of performances, and it should be admitted 
only if a number of measurements in this class (e.g, test results) 
overlap or correlate positively with one another. The basic mathe- 
matical methods of factor analysis are quite straightforward. By 
means of them we can find what tests, examinations, etc., really 
measure, i.e. their factor content. The terms g, group factor, 
communality, specificity and variance are explained. Some of the 
limitations of factor analysis and its relations to other psycho- 
logical methods are pointed out. 

Faculties and Phrenology. The faculties or powers of the 
human mind have been for centuries a matter of interest, not only 
to the ordinary man who wishes to explain his own conduct and 
that of other people, but also to the philosopher, psychologist and 
educationist. Until recent years, however, their nature and 
numbers were matters of pure speculation. Casual observation 
and introspection are incapable of providing scientific proof of 
their existence, and in consequence many past theories of human 
abilities and quahties and their organization were entirely fal- 
lacious. One of the most popular doctrines of the early nmeteenth 
century was the phrenology of Gall, Spurzheim and Coombe, 
which assumed that the strength of each faculty was indicated by 
the prominence of bumps on the appropriate parts of the skull. 
Thirty-seven faculties were recognized including Propensities 
(Amativeness, Combativeness, etc.), Sentiments (Self-esteem, 
Conscientiousness), Perceptive (Size, Colour, Number, Tune, 
Language), and Reflective (Comparison and Causality). We now 
know that traits and abilities are not located in particular parts of 
the brain, and that the growth of areas of the brain does not 
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produce protuberances on the skull. Indeed it has been said that, 
‘The bumps on a man’s skull tell you more about his wife’s 
character than his own.’ But the criticism which mainly concerns 
us is that Gall provided no evidence that these faculties are the 
fundamental ones, nor that they are independent or self-con- 
sistent. Is the sense of size really distinct from that of form; may 
not both depend on some more basic capacity? Is the person good 
at reading and spelling the English language necessarily good at 
languages in general? How about memory, reasoning, mechanical 
sense, and a host of other qualities absent from the list? 

Faculties in Educational Theory. Unscientific faculty 
psychology permeates educational theory and practice even at the 
present day. School subjects or new methods of teaching are 
introduced because they are alleged to develop such and such a 
faculty. For example. Nature Study stimulates powers of observa- 
tion, learning poetry develops the memory, and so on. A Board of 
Education circular in the 1930s justified physical training in schools 
not merely for its effects on the health of pupils, but because it 
enhances mental alertness, self-control and the team spirit. At 
one time it was thought that the reasoning faculty is only rudi- 
mentary in children up till the age of twelve, whereas memory is 
comparatively strong, hence primary schooling should concen- 
trate on drill subjects. Scientific research has largely contradicted 
these assumptions. Children of three years and younger often 
reason out the solutions to problems that interest them; the 
capacity for learning new material increases with age and is much 
superior in the young adult to that of the seven-year child, as 
Thorndike has shown. This capacity, moreover, cannot be im- 
proved all-round simply by practising the memorization of poetry, 
multiplication tables or spellings. And an experiment by Sutcliffe 
and Canham (1937) disproved the view that extra physical training 
in school hours is of sufficient benefit to the mind to compensate 
for the loss of time from intellectual work. 

Faculties in Popular Thought and in Vocational Psycho- 
logy. Equally dubious is the view so often put forward by parents 
that their boy who is backward at school work makes up for it by 
being good with his hands, or the assumption that the quick 
worker is usually inaccurate. We tend to divide people up into 
types — the practical, the academic, the leader, the aesthete, etc., 
forgetting that most individuals are good at some practical things. 
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not at others. The boy who would make a good carpenter would 
.not necessarily succeed as a plumber, nor a machinist as a civil 
engineer. The influence of this type theory manifests itself when 
employers distrust the psychologist’s paper-and-pencil spatial and 
mechanical tests for selecting tradesmen. Such tests, they think, 
are too ‘theoretical’ to pick the man with the practical ‘flair’ or to 
eliminate the ‘ham-handed’. The whole field of occupational 
selection and guidance has indeed been befogged by unverified 
speculations about qualities underlying jobs. It is not only the 
layman who analyses a job as requirmg ‘dexterity, alertness and 
concentration’ Psychologists have carried out more systematic 
studies of jobs, but many, particularly in Germany, have been 
equally guilty of assuming that names like these represent distinc- 
tive and consistent abilities (cf. Vernon and Parry, 1949). 

Definitions of Intelligence. It is psychologists again who, 
although they have been testing intelligence with some success for 
over forty years, have failed to reach any agreed definition as to 
what it is they are measuring. Binet frankly regarded it as a col- 
lection of faculties, ‘judgment, practical sense, initiative 

adapting oneself to circumstances’. His scale, however, was com- 
posed of tests which would differentiate older from younger 
children. The only criterion that they were measuring judgment, 
etc., was his own opinion. Several psychologists have considered 
intelligence as the ability to profit from experience, which con- 
trasts with the mechanical, instinctive reactions of lower species. 
None of the commonly used tests appear to involve any such 
quality. In a famous symposium published in 1921, thirteen 
psychologists gave thirteen different views. Terman stressed 
capacity for abstract thinking, Dearborn capacity to learn, Colvin 
adjustment to environment, and so on. Actually there is much 
overlapping between such views, but further theoretical discussion 
win not get us anjrwhere. It will not teU us just how much they 
have in common, nor what is the real essence of intelhgence.^ 

The Empirical Approach to Human Abilities, Based on 
Correlation. Psychologists nowadays tend to adopt a more 
operational or Behaviouristic outlook, though rejecting the wilder 
excesses of J. B. Watson’s doctrines. They realize the fruitlessness 
of mental entities such as faculties, which can never be directly 

^ A useful summary of these and other definitions, and an explanation of 
Spearman's two^factor theory, are given by Knight (1933). 
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observed nor verified, and prefer to deal with concepts directly 
derived from measurable activities of human beings. An ability is 
inferred from the fact that some people carry out certain tasks 
more rapidly or more correctly than others. Whether it also de- 
pends on some power in the mind is a matter which interests meta- 
physicians but not scientists. The clue leading to a scientific 
solution of the impasse is provided by the term overlapping, i.e. by 
correlation. By means of correlation we can find whether the scores 
of a group of people on two or more tasks correspond or not, and 
therefore whether these tasks involve the same, or distinctive, 
abilities. If several tests presumed to measure a particular abihty 
do not correlate positively with one another, that ability cannot be 
accepted as a useful conception. Take memory as an example. 
We all know that a schoolboy may have an excellent memory for 
cricket scores or names of motor-cars, and a poor memory for 
school work, and that a professor who remembers everything about 
his own subject may be absent-minded in daily life, or forgetful 
of names and faces. If these various kinds of memory are measured 
and inter-correlated and little or no agreement is found, it is ob- 
vious that there is no one general faculty of memory, but a lot of 
specific varieties. We need not demand that such tests correlate 
perfectly; they may show a limited amoufit of overlapping, and 
some may correlate more highly with the rest than others do. But 
only in so far as they do correlate can they be regarded as measur- 
ing a memory ability or factor. Otherwise each test is merely 
measuring the ability specific to that test and to no other. It follows 
too that any test can be regarded as divisible into two portions 
which we call its comrmnahty and its specificity, i.e. what it has in 
common with other tests and what is specific to it alone. 

There is yet another possibility. Positive correlations between 
several tests designed to measure memory might arise if the tests 
were in fact all measuring some other, more fundamental, ability, 
say intelligence. Factorial technique enables us to examine this, 
and to discover whether or not there is overlapping over and 
above anything attributable to intelligence. We thus arrive at the 
definition of an ability given by the writer elsewhere (Vernon, 
1940): ‘It implies the existence of a group or category of per- 
formances wluch correlate highly with one another, and which 
are relatively distinct from (i.e, give low correlations with) other 
performances.’ 
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Illustration of the Mathematical Fundamentals of Factor 
Analysis. It is unfortunate that this approach to the analysis of 
abilities involves somewhat complicated mathematics, since this 
frightens or antagonizes many of the teachers, employers, and 
others who are most prone to discuss abilities unscientifically. Yet 
the basic principles are very simple, as the following hypothetical 
examples will show. 


TABLE I —CORRELATION COEFFICIENTS BETWEEN SIX 
PSYCHOLOGICAL TESTS 


Tests 

1 

2 

3 

4 

5 

6 

1. 

Vocabulary 


+ •76 

+ •79 

+ •45 

+ •41 

+ *34 

2 

Analogies 

+ •76 


+ * 68 

+ *44 

+ •35 

+ •26 

3 

Classifications 

+ •79 

+ •68 


+ •49 

+ •39 

+ •32 

4. 

Block Design 

+ •45 

+ •44 

+ •49 


+ • 58 

+ *44 

5. 

Spatial 

+ •41 

+ •35 

+ •39 

+ • 58 


+ •55 

6 

Formboard 

+ •34 

+ *26 

+ •32 

+ *44 

+ *S5 



Table I gives the correlations that might be obtained between 
six tests applied to a large group of children (Block Design and 
Formboard being given individually). Inspection suggests that 
the correlations between the first three and last three are relatively 
small, i.e. that ability at verbal tests is partially distinct from ability 
at practical or spatial tests. But the separation is incomplete. All 
the correlations are positive, showing that all tests have something 
in common, presumably of the nature of general intelhgence By 
the appropriate techniques we can find how far each test measures 
this general ability or factor which we shall call g, and Table II 
lists the loadings, saturations or correlations with^. Now if this 
was the only underlying ability, we could reproduce the test inter- 
correlations simply by taking the products of their ^-loadings. 
For example; 

^35”^3g^^5g =*8x*5=*40 

Such products are listed in Table II, and in Table III each pro- 
duct has been subtracted from the corresponding original corre- 
lation to show what overlapping, if any, remains. These are known 
as residual correlations, 

B 
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TABLE II.— G-LOADINGS OF THE SIX TESTS AND THEIR 

PRODUCTS 



G-Loadmgs 

1 

2 

Products 
3 4 

5 

6 

1. Vocabulary 

*8 


•56 

•64 

•48 

•40 

•32 

2. Analogies 

*7 

•56 


•56 

•42 

•35 

•28 

3. Classifications 

•8 

•64 

•56 


•48 

*40 

•32 

4. Block Design 

•6 

•48 

*42 

•48 


•30 

•24 

5. Spatial 

•5 

•40 

•35 

•40 

*30 


•20 

6. Formboard 

•4 

•32 

•28 

•32 

•24 

•20 



TABLE III.— RESIDUAL CORRELATIONS AFTER SUBTRACTING 
THE OVERLAPPING ATTRIBUTABLE TO G 



1 2 3 

4 5 6 

1. Vocabulary 

2. Analogies 

3. Classifications 

+ -20 +-15 

+ •20 +-12 

+ •15 +-12 

+ •03 +-01 +-02 

+ •02 +-00 — 02 

+ •01 — 01 +*00 

4. Block Design 

5. Spatial 

6. Formboard 

— 03 +-02 +-01 

+ •01 +*00 — 01 

+ •02 — 02 +*00 

+ •28 +*21 

+ •28 +*35 

+ •20 +-35 


The residuals between the first three and last three tests are not 
all zero, but are so close to it that they can reasonably be attributed 
to chance errors in the original correlations. Within each group of 
three however the residuals are large, showing that distinct verbal 
and practical-spatial abilities are present. Each set can be analysed 
separately, and if the following loadings are multiplied out, they 
exactly reproduce the residual correlations: 


Verbal-factor 

loading 

1. Vocabulary -5 

2. Analogies -4 

3. Classifications • 3 


Spatial factor 
loading 

4. Block Design -4 

5. Spatial -7 

6. Pormboard • 5 
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Subsidiary abilities, over and above are called group factors, 
since they run through a hmited group of tests. It is preferable to 
name them by symbols, such as v for verbal, k for spatial, rather 
than giving them ability-names which may readily be misinter- 
preted Similarly we use g to refer to the objectively established 
general factor, instead of the subjective and indefinable term 
intelligence 

The communality of any test, i.e. its total factor-content, is 
shown by the squares of its factor loadmgs. Table IV lists these 
loadings, their squares, the communality (h^), and what is left over 
from 1.0, i.e. the specificities,^ Thus we can state that the Voca- 
bulary test measures 64 per cent, g, 25 per cent, v, and the remain- 
ing 1 1 per cent, is specific. The Formboard is a much poorer g 


TABLE IV.—COMPLETED FACTOR ANALYSIS OF SIX PSYCHO- 
LOGICAL TESTS 



Loadings 

g V k 

Squares of 
Loadmgs 
g V k 

Commim- 

ahty 

h^^ 

Speci- 

ficity 

1— h* 

1. Vocabulary 

•8 

•5 


•64 

•25 

•89 

•11 

2. Analogies 

•7 

•4 


•49 

•16 

•65 

35 

3. Classifications 

*8 

•3 


•64 

*09 

•73 

•27 

4. Block Design 

•6 


•4 

-36 

•16 

•52 

•48 

5. Spatial 

•5 


•7 

•25 

•49 

•74 

•26 

6. Formboard 

•4 


•5 

•16 

•25 

•41 

*59 


Vanance 

42-3 

8-3 15-0 

65-7 

34-3 


test, only 16 per cent, of what it measures being attributable to the 
general factor, 25 per cent, to k, and 59 per cent, specific. Such 
figures are known as the variances of the factors, and the average 
variance of each factor is given in the bottom row. These figures 
represent the importance or size of the factors in this battery of six 
tests. 

^ Many factonsts further subdivide this term mto the unrehability or error- 
variance of the test, and its true specificity. For example if the rehabihty co- 
efiSdent of the Vocabulary test is shown to be *94, then the error variance is 06 
and the vanance of the factor specific to the test alone *05 (cf. textbooks of psycho- 
logical statistics such as Guilford, 1936). 
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Factors Are Not Mental Elements Like Faculties. From 
this example it should be clear that a factor is a construct which 
accounts for the objectively determined correlations between tests, 
in contrast to a faculty which is a hypothetical mental power. We 
can if we wish go on to theorize about the psychological nature and 
origin of factors. Better, we can conduct experiments to discover 
just what performances involve a factor, among which groups of 
people it emerges, and what conditions affect it. But factors 
should be regarded primarily as categories for classifying mental 
or behavioural performances, rather than as entities m the mind 
or nervous system. Since by means of factor analysis we can 
reduce a large battery of tests to a few underlying factors there is a 
certain parallel to the analysis of chemical compounds into their 
constituent elements. But this analogy should not be pressed too 
far, for we shall see later that factors are much too fluid, too de- 
pendent on the particular groups and particular tests studied, to 
be compared with elements. For example we might expect, and 
will indeed find, that the factors in scholastic abilities are depend- 
ent on how school subjects are taught. Some teachers emphasize 
the connections between the various branches of mathematics, or 
between a country's language and its literature and history, much 
more than others do, and this is likely to be reflected in the cor- 
relations and factors. 

Identification of Factors. How factors should be identified 
and named is a somewhat controversial point. According to Guil- 
ford (1940) the factorist studies the common material, formal and 
functional features in the tests which are loaded with a factor and 
from this deduces its nature. Most factors are defined by material 
(e.g. verbal, mechanical information, etc.). The form of the test — 
whether apparatus or paper-and-pencil, choice-response or crea- 
tive-response — ^has not yet been proven to have much influence. 
Functional factors involve consideration of the testees^ mental 
processes, by means of introspections or job analysis procedures or 
both, (e.g. reasoning, attention, etc.). Bentley (1948) and others 
have criticized the looseness of factorists’ terminology, and the 
subjectivity of their guesses about the nature of some of their 
factors. We agree with him that it is better to avoid names of 
hypothetical functions or faculties, but would claim that the old- 
fashioned procedure (still common among some vocational 
psychologists, psychiatrists, teachers and others) of assuming 
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that a faculty exists and that certain tests measure it, is very much 
more subjective. Factorists do not, in fact, rely on hunches but 
always try to provide objective confirmation of a factor by carrying 
out further analyses with other populations and with enlarged 
batteries of tests, with a view to defining its content and extent 
more accurately. 

Some Limitations of Factor Analysis* The mistake should 
not be made of identifying the whole of the psychology of abilities 
with factor analysis. Vocational and educational selection and 
guidance must take account not only of personality traits and 
interests which might profitably be expressed as factors also, but 
also of relevant experience, home circumstances and the like. And 
although there is a strong case for substituting objective tests for 
the subjective judgments of an interviewer, in practice it is seldom 
possible to carry out such guidance without an interviewer to bring 
together all the data and to interpret them to the candidates (cf. 
Vernon and Parry, 1949) Still more important for the develop- 
ment of psychological science are experiments on conditions 
affecting the performance of skills and of mental tasks, for example, 
investigations of the design of equipment, or studies of the learning 
process, of concept formation, of physical or mental fatigue and 
boredom, and so forth. Here factor analysis is largely irrelevant, 
since it deals only with the end products of human thinking and 
behaviour, and throws little light on how these products come 
about in individual human beings. Factors are indeed a kind of 
blurred average, for though they derive from the common features 
displayed by a large group of people, they may stem from very 
diverse mental and physical processes in different people. Analysis 
does not even usually tell us which factors an individual uses in 
any given performance, though it probably could do so. Thus one 
individual may score well at a test through high g, another might 
get the same score by virtue of some group factor, yet another 
through specific ability at that particular test. 

The real need for factors arises as soon as we begin to discuss 
and name abilities or traits, and to compare the relative standing of 
different people on such faculties Factor analysis is complement- 
ary, not opposed, to the approach of the experimental psycho- 
logist; but both are opposed to the layman's unscientific specula- 
tions about human qualities and their underlying nature. 

It should be realized also that the ‘map' of the mind so far 
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provided by factor analysis is very incomplete, although it repre- 
sents a remarkable advance over what was known at the beginning 
of the century. Factorial investigations normally require the 
application of at least a dozen tests (Americans prefer forty to fifty) 
to several hundred subjects, and the labour of calculating the 
correlations and extracting the factors is almost too great to be 
done without mechanical aids. Moreover, the results are so 
affected by the particular tests used, especially when the battery is 
small, and by the background, sex, age, and other characteristics of 
the populations tested, that it is only by co-ordinating the findings 
of numerous analyses that reasonable certainty begins to emerge. 
Finally we shall see that different analysts often interpret the same 
results differently, though the confusion to which this leads is more 
apparent than real. 



CHAPTER II 


LANDMARKS IN THE DEVELOPMENT OF FACTOR 

ANALYSIS 

Abstract. Some of the investigations which contributed most to 
the development of factor analysis from 1904 to 1947 are surveyed. 
Until the 1930s the predominant view among American psycho- 
logists was that all abilities are highly specific. In Britain, the im- 
portance of the general factor, g, was demonstrated by Spearman, 
but the existence of additional sub-types of ability or group factors 
gradually emerged from the work of Burt, Kelley, Stephenson, El 
Koussy, Alexander, and others. Results obtained from analyses 
among recruits during the 1939-45 war confirmed the hierarchical 
theory, to which this book is committed. This holds that there are 
certain main types of abihty over and above g (in particular the 
educational and the practical types), and that these themselves can 
be subdivided into numerous minor group factors. Thurstone, 
Guilford, and other factorists in Amenca from 1938 on, opposed 
the notion of a general factor and hierarchy. Instead they showed 
that test inter-correlations could be accounted for by a number of 
independent types of ability or multiple factors, not unlike the 
nineteenth-century faculties However, more recent work suggests a 
rapprochement between the hierarchical and the Thurstonian 
viewpoints. 

The Viewpoint of Early American Psychologists. In the 
late nineteenth century the method of correlation was devised, 
largely by Galton and Pearson, for measuring the agreement 
between two sets of scores. Some of the first apphcations of this 
method to mental functions were made by Wissler (1901) and 
Thorndike m America, with disconcerting results. Tests of re- 
action time and sensory acuity, for example, showed scarcely any 
correlation with the grades of college students. Apparently the 
‘alertness’, ‘concentration’, ‘sensitivity’ or other qualities entering 
mto these tests were not the same as the ‘alertness’, etc., involved 
in university work. Indeed for many years the notion of measuring 
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mental ability in general was discredited in America. The experi- 
ments of William James, Thorndike and others on transfer of 
training reinforced the view that abilities are highly specific. If 
practice in learning English poetry does not improve the ability to 
learn French poetry, there can be no such thing as general memory. 
As Stratton put it ‘What you do to the mind by means of educa- 
tion knows its place; it never spreads You train what you train.’ 
Probably no American psychologist of the present day adheres to 
this extreme specific view, which Spearman called the anarchic 
theory of mental structure. Nevertheless it exerted a profound 
influence right up to the 1930s. Muscio (1922) in Britain, and 
Perrin (1921) in America obtained extremely small correlations 
between different tests of manual skills, and, in a well-known study 
of mechanical ability at Minnesota University, Paterson and 
Elliot (1930) showed that mechanical capacities are far from 
unitary. In the field of personality and character also diflFerent 
tests of the same trait often failed to correlate, one of the most 
striking investigations being that of Hartshorne and May (1928). 
These authors concluded that we should try to develop honest 
habits among children in each specific life situation, rather than to 
tram honesty in general. 

Spearman’s Two-Factor Theory. During the period 1900-30 
British psychologists, headed by Spearman, Thomson and Burt, 
followed a different course. It was in 1904 that Spearman pub- 
lished his correlations between sensory tests and estimates of 
intelligence which showed that: ‘all branches of intellectual 
activity have in common one fundamental function (or group of 
functions), whereas the remaining or specific elements of the 
activity seem in every case to be wholly different from that in all 
others.’ When correlations can be wholly accounted for by^, they 
tend to fall into what he called a hierarchical pattern.^ Later he 
developed the tetrad difference technique of proving that no signi- 
ficant factors other than g and specifics are present. 

The Abilities of Man (1927) contains the fullest account of 
Spearman’s theories and of the numerous supporting investiga- 


^ This term refers to a steadily descending order of the inter-correlations and 
should not be confused with the hierarchical structure of group factors, des- 
cribed below A useful and non-technical exposition of Spearman^s views is 
given by F* C. Thomas (1935) m Ahhty and Knowledge. The whole develop- 
ment of factor analysis, particularly in America, is reviewed by Wolde (1940) in 
Factor Analysis to 1940. 
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tions by himself and his students. In it he shows that neither the 
anarchic, nor what he calls the monarchic or oligarchic theories of 
the mind accord with the facts. The monarchic view reduces all 
abilities to a single capacity of general intelligence or ‘common 
sense’ This would imply that they are all perfectly correlated, and 
would make no allowance for the unevenness of people’s abilities 
along different lines. The oligarchic theory is the view that the 
mind is ruled by a number of separate powers or faculties. Spear- 
man's two-factor theory satisfactorily explained the tendency for 
all abilities to overlap to some extent, and yet to show considerable 
unevenness. The pupil who is good at English is usually superior 
also in anthmetic, and even at handwriting or in physical health. 
At the same time each subject involves its own specific, or s, factor; 
hence some pupils may be relatively better at English than at 
number work, or vice versa. The specific factors in practical and 
physical abilities are larger, their ^-saturations smaller, hence dis- 
crepancies in these abilities are much more common. 

The two-factor theory provides a logical basis, also, for devising 
satisfactory tests of g We need not, like Binet, choose tests or 
items which appear to involve judgment (or whatever we think 
intelhgence consists of). Instead tests are taken which have been 
proved, by correlational analysis, to have high i^-loadings Each of 
these tests will have some specific content, but as these s-factors 
are, by definition, independent, when we combine several tests 
the various ^’s will tend to cancel out, leaving us with a purer 
measure of g. 

The Nature of g and of Specific Factors. Although Spear- 
man wisely refused to identify g with intelligence or any other 
quality whose definition was controversial, he suggested that it 
depends on the general mental energy with which each individual 
is endowed. The ^-factors he compared to a large number of 
mechanisms or engines, which could be activated by this energy. 
They are largely affected by education and training, whereas g is 
innate and ineducable. By studying the tests with high or low g- 
saturations he concluded that the outstanding psychological char- 
acteristic of the former is that they involve seeing relationships, or 
to use his own terms, the eduction of relations and correlates. In 
answering, say, an arithmetic problem, the pupil has to grasp the 
relations between the various data presented, and to deduce some- 
thing new in order to reach an answer which bears the correct rela- 
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tions to these data. By contrast, if the pupil is asked merely to 
repeat the multiplication tables which he has learnt by rote, no 
new relationships are involved. Hence the ^-saturation of the 
former task is high, of the latter low. 

Thomson’s Criticisms of Spearman. Some of Spearman’s 
statistical techniques were strongly criticized by Thomson, and 
he argued that the two-factor theory was a possible, but not a 
necessary, inference from the statistical results (Brown and 
Thomson, 1921). The tendency towards positive correlation and 
zero tetrad differences could equally well be explained if abilities 
depend on a very large number of small causes in the mind (cf. 
Theory of Bonds, p. 31). 

Spearman’s Neglect of Group Factors. The chief criticism 
that would be raised nowadays against Spearman’s views is that he 
failed to allow sufficiently for types of ability which, while less 
general than g, are certainly not specific. He admitted indeed that 
different number tests, also different mechanical, and certain other 
types of test, show residual correlations over and above g. But he 
ascribed this to the presence of common specific factors and in- 
sisted that such 'specific overlap’ is very rare. Actually the notion 
of specific overlap is a contradiction in terms, and towards the end 
of his life Spearman did begin to recognize the existence of broad 
group factors such as the verbal and spatial, which arise from the 
overlapping of quite diverse ^-factors. One reason why his own 
work, up to 1927, failed to yield evidence of group factors was that 
he and his followers were seldom able to test large populations. 
Hence any residual overlap that did appear was usually not 
statistically significant; it might have arisen from chance errors in 
the correlations. But Spearman was unduly cautious and did not 
admit that lack of statistical significance does not disprove the 
existence of additional factors; it only fails to prove it. A large- 
scale experiment was carried out by Brown and Stephenson (1933) 
with the avowed object of demonstrating the truth or falsity of the 
two-factor theory Three hundred 10-year boys were given 
twenty varied tests Some of the pairs of tests did in fact show 
correlation beyond that accounted for by g. But the authors 
attributed this to specific overlap, and on eliminating the disturb- 
ing elements they were naturally able to prove that g was the sole 
factor present. Some years later Blakey (1940) re-analysed the 
correlations by Thurstone’s method, without omitting any of the 
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awkward overlap, and concluded that verbal, perceptual and 
spatial group factors were present, though their variance amounted 
only to 12.9 per cent, as compared with 41.2 per cent attributable 
tog 

It is noteworthy that if Spearman’s strict view was correct, 
educational or vocational guidance with the aid of tests would be 
impossible. We could not measure aptitude for linguistic or 
mechanical work by linguistic or mechanical tests, since both 
types of test would predict nothing but g. In fact the only tests 
worth using would be the purest g ones. By means of these we 
could determine the general level of occupation or education for 
which an individual was suited, but could not differentiate between 
different types of ability at this level. The only possibility would 
be to apply tests covering the specific factors in each prospective 
job. Thus an assembly test might measure the ^-component of 
mechanical assembly work, but would throw no hght on aptitude 
for lathe operating or other mechanical jobs 

In pomt of fact Spearman has proved much more nearly right 
than vocational and educational psychologists would wish him to 
be. We shall see later that group factors are generally more limited 
in scope than general, and highly specific, ones, so that it is indeed 
very difficult to differentiate types of aptitude. 

Burt’s Analysis of Scholastic Attainments. As early as 1909 
Burt had obtained suggestive evidence of a sensory discrimination 
group factor beyond g, and in subsequent years he explored the 
fields of imagery, temperament and scholastic attainments. His 
memorandum on The Distribution and Relations of Educational 
Abilities (1917) was a landmark since it provided clear evidence 
(which Spearman continued to ignore) of verbal, numerical and 
practical group factors in school subjects,^ in addition to a general 
factor. Also he arrived at the fundamental formula for the Simple 
Summation technique of analysis, later rediscovered by Thurstone 
and named the Centroid method, and developed techniques of 
assessing group factors. The verbal factor appeared to be two-fold, 
one part including the more complex or literary subjects — Com- 
position, History, Geography and Science, the other including the 

^Slightly earlier (1915-16), Carey mter-correlated the school examination 
marks of up to 500 children, and found a distmctive practical factor in Wntmg, 
Painting and Needlework There were mdications also of a verbal factor m 
Composition, Reading and Spellmg, but Geography, Science, History and 
Anthmetic appeared to depend only on the general factor. 
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simpler word-reading and spelling attainments. The practical 
group included Handwork, Drawing, Writing Quality and Speed. 
Substantially similar results were obtained with 613 ten-year 
children in 1939, except that the two types of verbal ability 
appeared to have amalgamated. For the average school subject 
the variance attributable to the general factor was 27*9 per cent., 
and to group factors 20-7 per cent. Another interesting point was 
that the general factor correlated highly, but not perfectly, with 
an intelligence test. This suggested that general scholastic ability 
is largely made up of g, but involves in addition such qualities as 
interest and industry. 

Kelley’s Crossroads in the Mmd of Man (1928). In America, 
Kelley studied the inter-correlations of batteries of tests given to 
three groups of over a hundred pupils, aged around 13, 9 and 
3^-6 years. By means of an elaborate and rather difficult technique 
which has seldom been used since, ^ he established much the same 
pattern of verbal, number, rote memory, spatial, and speed factors 
at each level. The general factor was still the most prominent in 
all groups, but Kelley accorded it a much less important role than 
Spearman, interpreting it as heterogeneity due to differences in age 
or maturity, race, nurture, sex, etc. 

The Minnesota Study of Mechanical Ability. This investi- 
gation by Paterson and Elliot (1930) represents another assault on 
the two-factor theory. The finding that the average correlation 
between some twenty-six tests applied to 13 -year boys was only 
-f - T7 is often cited to show that mechanical and motor abilities are 
highly specific. But actually the low correlations occurred chiefly 
among physical tests such as Dynamometer, Steadiness and 
Agility, and among certain questionnaires or assessments of inter- 
ests, which are likely to be unreliable at this age. Table X (p. 102) 
shows considerable overlapping among the twelve more important 
measures in the main experiment with 100 boys. What the in- 
vestigators did show (cf. Carter, 1928) was that a single general 
factor as defined by Spearman, and substantiated by the tetrad 
difference or other techniques, does not fit their results. But they 
admitted the presence of group factors, though they did not com- 
mit themselves as to their nature. They claimed also that mechani- 
cal ability or abihties are almost independent of g, but then they 
were considering only a single verbal group test which, according 

^ Except by McDonough (1929) 
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to later work, probably involves almost as much v as g. Further 
their group is likely to have been fairly highly selected, and we 
shall see below that ^-saturations are always reduced when all the 
testees are high (or all low) in g Further deductions from these 
figures are given on p. 101/. 

Group Factors Established by Stephenson and El Koussy. 

Two more investigations employing Spearmanian techniques 
deserve special mention. In 1931, Stephenson gave seven verbal 
and eight non-verbal intelligence tests to 1,037 girls, aged around 
ten to twelve. Correlations between the non-verbal tests could be 
accounted for by a single factor, which he identified with g. The 
verbal tests were more complex, but their correlations with one 
another and with the non-verbal tests, could be accounted for by 
g and a verbal group factor. It should be pointed out, however, 
that Stephenson’s results do not disprove the alternative of 
another group factor of a spatial-perceptual nature m the non- 
verbal tests — that is a structure similar to that of Table IV. In 
terms of variances (roughly calculated by the present writer) 
Stephenson’s solution was: 

g k V Communality 
Average non-verbal test 38% 0% 0% 38% 

Average verbal test 36% 0% 13% 49% 

A solution which would be more favoured nowadays, and which 
maintains the same communalities, would be: 

g k V 

Average non-verbal test 31% 7% 0% 

Average verbal test 44% 0% 5% 

The symbol k for the spatial factor was first applied by El 
Koussy (1935) who gave twenty-six tests to 162 boys aged 11 
to 13. He showed by tetrad analysis that eight of these obtained 
loadings on such a factor with about the same variance as their 
^-loadings. According to introspective evidence all these tests 
seemed to require visual imagery for their successful solution. 
Other tests employing visual material, together with Cox’s 
Mechanical Explanations and Completion (i.e. mechanical com- 
prehension) tests, and school marks in woodwork and drawing, 
gave only low correlations with this factor (cf. p. 66). 
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Thurstone’s Multiple Factor Analysis. In 1931 Thurstone 
developed the centroid technique of analysis and applied it to 
measures of attitudes and to ratings of personality traits, where it 
was natural to expect — not a general factor and small subsidiary 
group factors — but a number of components of more nearly equal 
variance. The diflerences between two-factor, group-factor and 
Thurstonian multiple-factor analyses may be illustrated by the 
diagrams in Fig. 1, According to the third diagram, no factor runs 
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Fig. 1, Two-Factor, Group-Factor and Multiple-Factor Analyses 


through all the tests. Each covers a different, though often over- 
lapping, set of tests. Thus the content of some tests can be as- 
cribed to one factor only, while others show significant loadings on 
two or even three factors. Note that the blank entries in the 
diagram are not usually zero loadings, but are so small that they 
can be attributed to chance. In a group-factor analysis, however, 
every test has a general factor loading and a loading on one (or 
occasionally more than one) group factor. In each type of analysis, 
every test shows its own specific factor. 

Alexander’s Investigation. Apparently the first application 
of Thur^tone’s method to abilities was that of Alexander (1935), 
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who gave large batteries of verbal and non-verbal intelligence 
tests and certain performance tests to groups of about one 
hundred Scottish primary school boys and girls (11-12 years), 
American secondary and technical school pupils (16-17), and adult 
women in a delinquent institution For the technical school group 
he also had school exammation marks. Actually the multiple 
factors that he obtained conformed quite closely to a group factor 
pattern. Thus in addition to there was a ‘v factor in the verbal 
tests, and some of the more complex and constructive performance 
tests gave a practical group factor, which he called F, It was on 
the basis of these results that Alexander developed his performance 
test scale, consisting of Cube Construction, Kohs Blocks and 
Passalong, for measuring ‘concrete’ or practical abihty. Another 
important finding in the third group was that the measures of 
school attainment showed a separate group factor of their own, 
thus confirming Burt’s results, mentioned above. He called this 
factor X, and identified it, very plausibly, with the influence of 
personahty and interests, i.e. with something in the nature of 
industriousness which affects all school work.^ 

Thurstone’s Primary Mental Abilities. In 1938 Thurstone 
published the first of his long series of investigations of human 
abilities, namely an analysis of fifty-six tests given to 240 college 
students. This seemed to reveal a complete break with Spearman, 
since there was no g at all, but — much as in the personality field — a 
series of distinct multiple factors. The eight main or primary 
factors were identified as: 

V Verbal P Perceptual Speed 1 Inductive Reasoning 

AT Number M Rote Memory D Deductive Reasoning 
W Word Fluency S Space or Visualization 

Note that American multiple factors are usually assigned 
capital letters, while British g and group factors receive small 
letters. Though their content and derivation are very different, 
the status of these primary factors is closely similar to that of 
nineteenth-century faculties, against which Spearman had battled 

^ Niimerous previous 'writers had, of course, interpreted discrepancies be- 
tween I.Q. and E.Q. as due to emotional influences. In effect the X factor is 
much the same as the A.Q. — ^the accomphshment or achievement quotient. 
Alexander claimed yet another factor, Zy of doubtful rehabihty, which has smce 
been identifled by Yela (1949), after re-analysis, with Thurstone^s or Meih*s 
perceptual synthesis factors (cf. pp. 58, 89). 
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for over thirty years. Spearman (1939) was quick to point out that, 
as all Thurstone's tests were positively inter-correlated, they could 
equally well be analysed to yield a large general factor and smaller 
group factors. Such alternative analyses of Thurstone’s figures 
were, m fact, carried out by Holzinger and Harman (1938) and 
Eysenck (1939). The latter’s g obtained a variance of 30*8 per 
cent., and the combined group factors 23 • 5 per cent The content 
of the group factors corresponded quite closely to that of 
Thurstone’s primary factors, just as in Fig. 1/3 the multiple 
fectors A, B and C cover much the same tests as the group 
factors of Fig. 1/2. Although Thurstone’s solution of the 
factors underlying the tests is as legitimate mathematically as a 
general + group factor solution, he has not disproved the 
existence of a g. In effect he has divided it up among his seven 
group factors. The arguments for and against these alternatives 
(which are much less irreconcilable than might appear at first 
sight) are examined in the Appendix. 

Subsequent American Work. Almost all factorial psycho- 
logists in America^ have followed Thurstone’s lead, and their 
results, like his, can readily be fitted into the picture of mental 
structure advocated in this book. With their vastly greater re- 
sources for applying huge batteries of tests to large groups, and for 
doing most of the donkey-work of calculation by machine, it is 
natural that they should be responsible for most of the advances of 
recent years in this field, 

Thurstone, aided by his wife and students, have greatly extended 
the above investigation. Several of the primary factors have been 
studied m more detail, by analysing the original tests for a factor 
along with others which helped to define it more accurately, or 
showed how it could be sub-divided. And comprehensive batter- 
ies, similar to the original one, have been given to high-school and 
younger pupils, even to 5-6 year olds. The results at differ- 
ent ages have been remarkably concordant. Thus the first six 
factors in the above list were clearly identified when sixty tests 
were analysed among 710 pupils aged around 14 years (Thurstone, 
L. L. and T. G., 1941), though Deduction disappeared and In- 
duction seemed to be better named R (Reasoning). P was also 
somewhat unstable, and has been omitted from the battery of 

^ With the exception of Holzmger, and R. B. Cattell, the latter a pupil of 
Spearman. 
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Primary Mental Abilities tests issued for measuring these factors. 
One noteworthy difference, however, is that the primary factors 
among children tend to be less independent Since they correlate 
with one another moderately highly, they themselves can be 
analysed in the same way as tests are analysed, and they usually 
reveal a kind of super-factor, or what Thurstone calls a second- 
order general factor.^ Though he does not go so far as to identify 
this with gy he admits that it constitutes a bridge between his own 
and Spearman’s viewpoints. He now describes primary factors as 
‘facilities’ in the mind, or ‘media of expression’, and regards 
second-order factors (of which g may be one) as more central 
(Thurstone, L. L., 1948). This theory is strongly reminiscent of 
Spearman’s general energy and specific engines. 

The U.S.E.S. Investigations. In 1945 there appeared the first 
report on large-scale researches by the United States Employment 
Service, Division of Occupational Analysis, into the development 
of a set of differential aptitude tests (cf. Staff, Division of Occupa- 
tional Analysis, 1945). Various battenes of about twenty tests 
(fifty-nine in all) were given to nine fairly large and representative 
groups of adult applicants for employment, totalling 2,156 per- 
sons. On* analysis, the most stable or consistent factors, which 
recurred in most of the groups were: 

V Verbal P Perceptual T Motor Speed 

N Number Q Clerical F Finger Dexterity 

S Space L Logic M Manual Dexterity 

A Aiming 

also a general factor. 

The best tests for measuring each factor have been issued as a 
battery, lasting about three hours, and from a testee’s profile or 
pattern of factor scores, it is hoped to predict the type of occupation 
for which his aptitudes fit him (Dvorak, 1947). 

Factorial Studies in the U.S.A.A.F. During the 1939-45 war, 
large-scale testing of recruits occurred in Britain and America, and 
factor analyses were often carried out on populations of a thousand 
or more. Particularly extensive use of the technique was made by 
Guilford and his collaborators in the U.S. Army Air Force (Guil- 
ford 1948ab, Guilford and Lacey 1947, Davis 1947, Melton 1947). 
Studies of the job of the pilot and of other air-crew personnel 

^ The mathematics of oblique (i.e. correlated) factors are clearly described m 
the later editions of Thomson’s textbook (first pubhshed 1939). 
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suggested what abilities might be worth testing. Elaborate and ex- 
tremely ingenious tests for measuring each of these were devised, 
and factorization was applied to discover which were consistent 
and distinctive. By this means pilot aptitude itself was largely 
broken down into factors objectively definable by appropriate test 
batteries, instead of into subjectively determined qualities. More 
then twenty factors are claimed : 


Carefulness 

Integration I, II 
and III 

Length Estimation 

Memory I, II and 
III 

Mathematical In- 
terest and 
Training 


Mechanical Infor- 
mation 

Perceptual Speed 

Pilot Interest 

Planning Ability 

Psychomotor Co- 
ordination 
Psychomotor Pre- 
cision 


Psychomotor Speed 

Reasoning I, II and 
III 

Spatial Relations I, 
II and III 

Social Science Inter- 
est and Training 

Verbal 


Visualization 


Factorial Studies in the British Services: Hierarchical 
Group Factor Theory. In this country, where most work was 
done on less selected samples of the population such as Navy and 
Army conscript recruits, the importance of g was amply confirmed 
(Vernon, 1947a). In eight analyses, g was found to cover more than 
twice as much variance as all group factors combined. Table V 
shows an analysis of thirteen tests given to 1,000 Army recruits. 
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SpeciFic factors 


Fig. 2. Diagram illustrating Hierarchical Structure of Human Abilities 
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and brings out a feature which appears to be highly characteristic 
of mental structure, namely hierarchy. After the removal of g, 
tests tend to fall into two main groups: the verbal-numerical- 
educational on the one hand (referred to as v:ed factor), and the 
practical-mechanical-spatial-physical on the other hand (referred 
to as him factor) If the analysis is sufficiently detailed, i.e. if 
sufficient tests are included, these types themselves sub-divide* 
The vied factor in Table V gives minor v and n (number) group 
factors. In other analyses (e.g Table IX), kim splits similarly into 
mechanical information, spatial, and manual subfactors. Thus a 
first approximation to mental structure is provided by the hier- 
archical diagram of Fig 2, resembling a genealogical tree Its 
advantages and limitations form the subject of the next chapter. 


TABLE V. SIMPLE SUMMATION AND GROUP FACTOR ANALYSES 
OF TESTS GIVEN TO 1,000 ARMY RECRUITS 


Tests 

Unrotated Centroid Fac 
I II III IV 

tors 

ha 

Group Factors 
g k m ed. V n 

h* 

0 Progressive 
Matrices 

'll + 23 +*10 — 16 

•68 

•79 -17 

•65 

Dominoes 

(non-verbal) 

•80 +-09 +*19 — 12 

•70 

•87 

•75 

Group Test 
70, Pt I 

•74 +-16 + 03 — 08 

•58 

•78 -13 

•62 

4 Squares 

•63 +-35 — 00 +-01 

•52 

•59 -44 

•54 

8 Assembly 

•37 +-54 — 15 +-28 

•52 

•24 -89 

•85 

2 Bennett 
Mechanical 

•69 +-33 — 17 +-07 

•62 

•66 -31 

•54 

25 Verbal ' 

•88 —24 — 26 — 14 

•92 

•79 -29 -45 

j -90 

Dictation 

•79 — 42 — 25 — 11 

•88 

•62 -54 -48 

I 90 

14 A.T.S. 
Spelling 

•81 — 32 — 20 — 11 

•80 

*68 -41 -43 

•82 

21 Instructions 

•89 -—06 + 11 — 15 

•82 

•87 -23 -09 

•82 

3a Arithmetic, 
Pt. I 

•84 —-29 + 22 +-23 

•89 

•72 49 -39 

•91 

Arithmetic, 
Pt 11 

86 — 16 +-12 +-13 

•80 

•80 -38 -16 

•82 

23 A T.S. 
Arithmetic 

•84 — 21 +*26 +-14 

•84 

•77 36 *32 

•82 

Variance 
per cent. 

59 8 8 5 3*1 2-2 

73-5 

52 5 8-7 8-4 6 9 

76-5 


The Relations Between Group-Factor and Multiple-Factor 
Analyses. Table V also serves to illustrate some of the resem- 
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blances and differences of centroid and group-factor analyses. In a 
centroid analysis the first factor represents the highest common 
element in all the tests It is not usually the same as g, but is a kind 
of average of the particular tests applied in the investigation. 
Subsequent factors, II, III and IV, are known as bipolar, since 
roughly half the tests receive positive, half negative, signs. These 
successively divide the tests into contrasted groups, and although 
they may have no psychological meaning as they stand, yet they do 
usually reveal what group factors are present. Often this classifica- 
tion by bipolar factors tells us all we need to know, and in several 
examples in subsequent chapters the original, or unrotated, 
centroid factors alone are quoted. But it is preferable to transform 
the first and the bipolar factors into a series of factors where all the 
tests have either positive loadings, or zero or insignificant negative 
loadings, by means of what is called rotation of axes. This of 
course redistributes much of the variance of the first factor among 
the remaining ones. Actually the aim of rotation is to maximize the 
number of zero or insignificant loadings on each factor, so that as 
much as possible of the variance of each test is confined to a single 
factor. Thurstone calls this Simple Structure, Often such rotation 
does yield a general factor running through most or all of the tests, 
and smaller factors each confined to a few tests, in other words a 
group-factor pattern. But true group-factor analysis is carried out 
by assessing ^-loadings first, and then analysing the residual cor- 
relations in each group of tests, as in Tables I-IV. A clear and 
much fuller account of these different types of analysis, and their 
inter-convertibility, is given by Burt (1944; cf. also 1938, 1939a, 
1940a, 1949). 

Other Methods of Analysis. This historical r&ume must not 
omit to mention, however briefly, certain other approaches to 
factor analysis which have been less widely applied than general 
or group factor, and simple summation or centroid, methods. 
Broadly speaking they are more accurate mathematically, but do 
not provide appreciably more psychological information about the 
make-up of the analysed tests, which would compensate for their 
much greater complexity and tediousness of calculation. They 
include Burt’s Weighted Summation, Lawley’s Maximum Likeli- 
hood, Hotelling’s Principal Components, and Kelley’s Principal 
Axes, methods. Explanations may be found in Thomson’s and 
Burt’s textbooks. 



CHAPTER III 


HIERARCHICAL GROUP-FACTOR THEORY OF THE 
STRUCTURE OF ABILITIES 

Abstract, The strict hierarchical picture of mental structure is 
an over-simplification. For the results of any factor analysis 
depend largely on the composition of the population tested (e.g. 
its degree of selection), and on the number and kind of tests 
studied. Since by choosing suitable tests almost any specific factor 
can be turned into a group factor, it is suggested that only those 
group factors shown to have significant practical value in daily life 
are worth incorporating in the picture. It is doubtful whether 
group factors differentiate merely as a result of ageing or mental 
growth. Rather, their pattern or structure changes according to 
the type of education and training. Thomson’s theory of bonds 
gives a useful explanation of ^ and of ability group factors, but the 
influence of temperament and personality, physique, sex, age, 
interests, etc. should be taken into account. The fallacies of the 
layman’s conception of theoretical vs, practical, and other opposed 
types of abilities are discussed. 

The Hierarchical Diagram Should be Regarded Only as an 
Approximation. The hierarchical theory which was outlined at 
the end of the preceding chapter, was first put forward by Burt, 
under the influence of McDougalL In a recent article (1949) Burt 
describes how it originated, and shows that it applies in the fields 
of temperament and of anthropometric measurements, as well as 
to abilities. Though it is certamly an improvement both on the 
original two-factor theory and on the ‘neo-faculty’ theory of 
American writers, it has numerous hmitations and imphcations 
which we must now discuss. The more technical arguments for 
preferring it to the American viewpoint are given in the Appendix. 

A diagram such as Fig. 2 would be obtained only if an extensive 
battery of tests, covering — or at least samphng — most of the varie- 
ties of human abilities, could be applied to a very large and repre- 
sentative sample of the population. With one or two hundred 
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testees the correlations are usually too unreliable for more than 
two to four group factors to be established at a time. In general a 
minimum of three tests is needed to define a factor, hence only a 
few factors can be resolved in any one investigation with a limited 
battery of tests. Further, if such a battery consists only, or pre- 
dominantly, of a specialized type of test (e.g, all tests of sensory- 
motor abilities), the g and major group factors may fail to reveal 
themselves. The diagram is, in other words, a hypothetical inte- 
gration of all the factorial investigations that have been carried 
out, rather than an established fact. It is considerably expanded, 
and corrected, in Figs 3, 4, 5 and 7. 

Broad and Narrow Group Factors. Nevertheless there is 
ample evidence to support the view that group factors are almost 
infinitely subdivisible, depending only on the degree of detail to 
which the analysis is carried. Indeed, by including sufficiently 
similar tests, any specific factor (in Spearman’s sense) can become 
a group factor. The only truly specific element is the unreliability 
or error variance of the test. Thus in a complete factorial investi- 
gation the commonality of each test should approximate to its 
reliability coefficient. It is even possible, when analysing special- 
ized tests, for a specific factor to become a general one. For 
example, a reaction time test analysed in a battery of paper-and- 
pencil mental tests might obtain a g and major group factor 
variance of about 10 per cent., specificity 90 per cent. On includ- 
ing two other kinds of reaction time, a small group factor would 
appear, while in a battery consisting only of such tests, a general 
reaction time factor with 30 per cent, or higher variance might be 
found, and we should be unaware that this was composed partly of 
‘higher’ factors such as g and him. Thus there is no absolute dis- 
tinction between general and specific factors as Spearman believed. 

An important problem, as yet unsolved, is how broad a group 
factor should be before it is accepted as a useful element in our 
picture of mental structure. Some of the primary factors of Thur- 
stone, Guilford and others — rote memory, for example — are so 
narrow in content that it may be more harmful than helpful to 
name them. It is unlikely that the rote memory tests which make 
up Thurstone’s M factor have any predictive value for the rote 
memory in which teachers are interested (cf. Carroll, 1943; 
Wittenborn and Larsen, 1944). Only if it were possible to estab- 
lish a common factor (over and above g and v:ed) in rote memory 
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tests and in the learning of spellings, multiplication tables and 
poetry, would the notion of a rote memory group factor be accept- 
able. The same stricture holds for most of the manual dexterity, 
sensory-motor, and co-ordination factors that have so far been pro- 
posed. At the present moment the writer cannot think of any ob- 
jective basis for distinguishing between acceptable group factors, 
and narrow factors confined to the highly specialized types of test 
which psychologists delight in constructing. But he would suggest 
that factors which fail to contribute at least 5 per cent, to the 
variance of some measure of educational or occupational pro- 
ficiency or other capacity in daily life should be relegated to the 
latter category. If, for example, g and v tests alone predict the 
ability to learn and retain poetry to the extent of a correlation of 
60 (i.e. a variance of 36 per cent.), then the addition of tests of 
rote memory to the predictive battery should raise the correlation 
to at least * 64 (variance 41 per cent ) if the factor measured by 
these tests is to be acceptable,^ Such a criterion involves sub- 
jective judgment as to what constitutes a ‘capacity in daily life’, 
and is beset with many difficulties. But it appears preferable to a 
judgment of the broadness vs narrowness of the tests which yield a 
distinctive factor.* 

Relative Importance of Factors at DiflFerent Levels, The 

hierarchical group-factor viewpoint implies that most of the vari- 
ance of human abilities in daily life is attributable to g and to 
highly specific (or very small group) factors, and that the role of 
the broader group factors is rather meagre. If our diagram could 
be worked out completely to cover all human abilities, the g- 
variance might amount to about 40 per cent., the major and minor 

^ This suggestion recalls Thomson*s (1939) argument that factor analysis is 
of httle use in vocational or educational psychology, because the predictive value 
of tests can be estabhshed much more efficiently by multiple correlation tech- 
nique The writer would agree that the content of a test as determmed by factor 
analysis at the present time often fails to reveal its true predictive value, because 
the test’s specificity may embody other group factors which are particularly 
relevant or irrelevant to some job. For example a test of graph-readmg is very 
useful m selecting radar operators, but when analysed it usually appears to con- 
sist purely of g + w 4- specificity. More detailed analysis would how- 
ever break down part of the specificity mto a mmor group factor for graph- 
reading, 1 e a sub-division of n This is the hne that Guilford and Lacey fol- 
lowed m the U.S A.A F. After considerable experience of multiple correlation, 
the writer has come to the conclusion (Vernon and Parry, 1949) that it is much 
too efficient It does not, like the factorial approach suggested m this note, 
sufficiently allow for the chance errors m the vahdity coefficients of selection 
tests, and prohibits the mclusion of two or more rather similar tests m order to 
improve the rehabihty of prediction. 
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group factors to some 10 per cent, each, and the remaining 
40 per cent, would consist of very narrow group factors and un- 
reliability This means that, fairly good predictions of ability in 
education, industry, or everyday life, can be achieved by g tests 
alone, and that somewhat more ground can be covered by tests of 
the main group factors. But only by much more detailed experi- 
mentation on tests relevant to particular jobs, or by work-sample 
methods (i.e. trying candidates out on the actual work), can much 
more than SO per cent accuracy be obtained. This explains why 
Stanford-Binet or Terman-Mernll I.Q., or all-round intelligence 
as measured by reliable group tests, have considerable practical 
value both among children and adults, whereas more specialized 
tests add something but not very much in educational and voca- 
tional guidance. 

Effects of Range of Ability and Selection on Factor Patterns. 

However, these quantitative estimates of the importance of different 
‘levels’ of factors are dangerous, since so much depends on the 
degree of heterogeneity of the people concerned. When the same 
tests which, among unselected recruits, gave g and group-factor 
variances of 50 per cent, and 20-25 per cent., respectively, were 
analysed among high-grade mechanics or officer candidates, g 
often fell to IS per cent, and group factors rose to 35 per cent. 
Similarly it has often been observed that the g and k:m content of 
manual dexterity tests sinks almost to zero among such selected 
groups as technical school pupils or college students; such tests no 
longer correlate at all with paper-and-pencil tests of mental 
abilities. Since the people receiving any special type of education 
(e.g. grammar school or university), and those engaged in any one 
job, almost always constitute a strongly selected group, there is 
usually more scope for tests of major and minor group factors and 
less for g tests, than the preceding paragraph suggests. (The 
relatively unpredictable portion is however generally larger among 
such groups.) It also follows from Thomson’s (1939) demon- 
stration of the effects of selection on factors that not only the 
general degree but also the type of selection is important. For 
example, if a battery of mechanical and other tests is applied to a 
group of men with engineering background, and to another group 
with the same range of g but without experience, the factor pat- 
terns are liable to differ (cf. Chapter X). This accounts for many of 
the discrepancies between the results of different investigators, and 
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makes it extremely difficult to fill out the details of our diagram 
accurately. We cannot expect to reach a final and complete map 
of the structure of abilities, since it necessarily varies with the kind 
of population studied. 

Differentiation of Specialized Abilities with Age. The rela- 
tive prominence of g has often been thought to depend considerably 
on age. The writer at one time accepted the view that g tends to 
differentiate into more specialized abilities during adolescence and 
early adulthood. This view is advocated by Garrett (1946), who 
summarizes several confirmatory investigations. But most of these 
compare college with high school, or high school with elementary 
school, populations, hence the smaller ^-variance in the older groups 
may be due merely to their greater selectivity. 

Clark (1944) did choose groups of eleven-, thirteen- and fifteen- 
year pupils with the same distribution of group test I.Q.s, and 
found a decline in the average inter-correlation of the Primary 
Mental Abilities tests from *488 to -393. Other studies such as 
those of Swineford (1947), Reichard (1944) and Doppelt (1949) 
fail to support the theory. Anastasi (1936) summarizes a large 
number of investigations and shows that, though there are strong 
indications of alterations in factor patterns with age and training, 
the evidence for differentiation is far from unanimous. McNemar 
(1942b) carried out fourteen factorizations of Terman-Merrill scale 
items at mental age levels ranging from 2 years to 18 years. 
His results are irregular and show no sign of any consistent trend 
towards greater differentiation at later ages. This might be 
criticized on the grounds that the later items are less diverse in 
content than those for young children. However Balinsky (1941) 
factorized an identical battery of tests, namely the Wechsler- 
Bellevue scale, among groups aged 9, 12, 15, 25-9, 35-44 
and 50-59, all with average I.Q. 100. He obtained the following 
first factor variances: 38, 36, 24, 20, 33 and 45. These suggest 
differentiation from 9 to 30 and then greater integration. But 
he neglected to ensure the same degree of heterogeneity among 
the testees at all ages, and when correction is made for this, his 
first factor variances show much the same irregularity and lack of 
any clear trend as McNemar's.^ 


^ It is difficult to see how a crucial investigation of this problem could be 
planned even if stnctly random samples could be tested at several age levels. 
Fpr as Emmett (1949) pomts out, the content of the tests should be equally 
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Particularly striking are Williams's (1948) results from the ap- 
plication of the same battery of ten intelligence, spatial and me- 
chanical tests to samples of 250 boys, carefully chosen to be 
representative, at the ages of 12, 13 and 14. Here the first factor 
variances were 51, 56 and 62 per cent., respectively, indicating that 
secondary education tends to produce greater integration, not 
specialization, of verbal and practical abilities. In a research by 
the writer the standard British naval battery of five tests was given 
to 1,171 boys leaving school at 14, and the results were compared 
with those of 265 seamen recruits who had also left at 14 in the 
same district some four years previously. Scores tended to rise 
with age on the spatial and mechanical tests, and to drop on the 
arithmetical ones (cf. Vernon and Parry, 1949), but the average 
inter-correlation and ^-saturations were almost identical. The only 
significant change was a drop in the correlation of mechanical 
arithmetic with mathematics (from *642 to *379), The correlation 
of k:m tests with these educational tests were slightly lower, but 
with a verbal reasoning test (mainly a measure of g) they were 
slightly higher. Similarly, two parallel groups of about 240 naval 
artificer apprentices aged 15+ and 18+ took nine varied tests, and 
most of the correlations were slightly higher in the older group, 
although the nature of the group factors altered considerably 
(cf. p. 116). 

Another approach to this problem was to calculate tctrachoric 
inter-correlations among ten tests, first by contrasting the top 
25 per cent, of a group of 993 Army recruits with the bottom 75 
per cent., secondly contrasting the top 75 per cent, with the 
bottom 25 per cent. If abilities are more differentiated among the 
more able and intellectually mature, the former correlations 
should be lower than the latter, but actually the reverse occurred. 
Many more men were very high on all tests than were very low on 
all tests, and the first factor variances were 75 • 6 and 66 • 3 per cent., 
respectively. 

From these and other bits of evidence the writer would conclude 
that there is no general tendency towards differentiation, except 
perhaps in early infancy, and that everything depends on the type 
of educational and vocational training. Usually when abilities are 


appropriate at all ages, otherwise reliability is affected, and with it the first " 
factor variance. This condition may have disturbed both Williams's and the 
writer’s results reported in the next paragraph. 
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practised at school or in jobs they tend to become more specialized, 
though sometimes the teaching is of such a nature as to increase 
integration. Again regression or de-differentiation may often 
occur as the effects of past training wear off. It is conceivable that 
secondary schooling is more fragmentary in America than in 
Britain, and so apt to produce more differentiation between 12 
and 18 than is usual here. But undoubtedly the mam reason for 
the apparent reduction in the importance of g m adults is that the 
testees are more homogeneous in ability. 

It is because the majority of American investigations are con- 
ducted with college students, aircraft pilots, high-school pupils and 
other selected groups, that their results so readily fall into inde- 
pendent primary factors, instead of ^ and group factors. But when 
more heterogeneous adult groups have been studied, a g has 
usually appeared. Thus Anastasi (1948) quotes American Army 
studies which showed almost as high correlations between verbal, 
numerical, spatial and mechanical tests as between different 
numerical, or different mechanical tests. An analysis of the U.S. 
Navy battery gave zg with variance of over 30 per cent., together 
with smaller mechamcal, spatial and educational group factors 
(Staff, Test and Research Section, 1945). The emergence of a^ 
in the Division of Occupational Analysis’s investigations has 
already been mentioned 

Psychological Nature of Factors- We must next consider the 
nature of g and the group factors a little more closely. Thomson 
has shown that the statistical fact that test inter-correlations can 
be largely accounted for by a single factor does not prove that such 
a factor represents any unitary power, or organ of the mind. It 
nought also arise if the mind is thought to consist of an immense 
number of ‘bonds’, including inherited reflexes, acquired habits 
and associations, etc. A person’s performance at any one test 
would involve the activation of a large number of such bonds, and 
if a miscellaneous set of tests is given, the extensive sampling of 
bonds would result in the positive correlations that actually tend 
to occur. But he agrees that factors are useful concepts for 
describing the content of the various kinds of samples that may be 
taken, provided that they are not reified into organs or faculties. 
In this book we accept Thomson’s view, and hold that factors over 
and above g arise, partly perhaps from hereditary influences, but 
mainly because an individual’s upbringing and education imposes a 
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certain grouping on his bonds. The v:ed factor is, as we shall see, 
a rather strongly unified group because our society gives a fairly 
uniform education to all its members. It does not readily break 
down into separate verbal, number, speed, reasoning, attention, 
memory or other factors because the abihties covered by these 
names tend to be developed differently in different schools and 
homes, though partially distinct minor group factors can often be 
established, especially in fairly homogeneous groups such as 
university arts students. On the practical or k:m side there is, as 
Anastasi points out, less cultural standardization, hence the k:m 
pole is more heterogeneous and amorphous than v:ed. It would 
appear to be not so much a positive practical ability as an aggregate 
of all non-symbohc capacities, or of bonds that are not usually 
affected by primary schooling. Nevertheless, evidence is given 
below that not only mechanical and spatial, but physical and 
manual, and some non-verbal g, perceptual and performance tests 
all have something in common over and above g. The kind of test 
which is most strongly saturated with this factor is the mechanical 
assembly test, presumably because this epitomizes, as it were, 
non-scholastic activities. A rather significant point is that boys 
and men (at least in Western European civilization) tend to sur- 
pass girls and women on most aspects oik:m, whereas females tend 
to be superior in the linguistic aspects of v:ed. Though this might 
be attributable to cultural norms, it suggests the operation of here- 
ditary influences. Thus it cannot be due to upbringing that girl 
babies usually start to talk earlier, or that boys and men have on 
the average greater physical size and strength. 

Modifications of the Hierarchical Picture. It follows that 
there is no need to regard the hierarchical or genealogical principle 
as pre-eminent. Minor group factors are not always ‘descendants’ 
of either v:ed or k:m. And we shall see later that several factors cut 
across this dichotomous grouping, scientific ability, for example. 
Probably also there are other group factors which split off from g 
but are not subdivisions of either type. Auditory and musical 
abilities seem to constitute one such cluster, and others are men- 
tioned in Chap. VIII. It is merely because v:ed and k:m have the 
widest practical importance, and have received most investigation, 
that they alone are listed in Fig. 2. 

Temperamental and Other Influences. In yet another res- 
pect the diagram over-simplifies matters. It does not allow for 
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personality, physique and other factors which have complex inter- 
actions with ability factors. Physique and physical health con- 
stitute an important dimension (or set of dimensions) which cer- 
tainly affects practical abilities; and physical defects of the senses 
in particular react on educational attainments. Sex influences not 
onlyv.ed~k:m, but also most of the lower-order group factors, such 
as manual, imagery, etc. (Burt and Moore, 1912). Age is important 
in spite of the conclusions reached above. Thus among adults the 
spatial, manual and physical aspects o{k:m tend to dechne, whereas 
specialized mechanical skills and information probably go on in- 
creasing almost to senility (cf. also deterioration, p. 57). Cattell 
(1946) has pointed out that g is somehow associated with such per- 
sonality traits as conscientiousness and with cultured interests. 
Terman’s work on gifted children, and studies of mental defec- 
tives, confirm this. Apparently therefore the bonds established by 
character training, and by the development of sentiments and 
attitudes are linked with the bonds responsible for our cognitive 
or intellectual activities. Doubtless interests greatly affect our 
more speciahzed abilities. It is known also that the fluency factor, 
measured by tests of richness of association with words or pictures, 
is connected with extraverted or cycloid trends, and that such 
physical or manual capacities as visual acuity, dark adaptation, 
agility, and finger dexterities are impaired among neurotics (Slater 
and Slater, 1944; Eysenck, 1947). Again Eysenck finds speed 
accuracy in mental and manual operations to differ among hysteric 
and dysthymic (anxiety or obsessional) neurotics. 

Clearly then we are very far from a complete theory of the 
structure and nature of human abihties, and though it is useful to 
analyse them in isolation as though they were purely cognitive or 
motor, we should not forget that they are abstractions from the 
total personality structure. 

Conclusions Regarding g. Finally it may be seen from 
Thomson’s theory that g is not a fixed, purely inherited, quantity. 
Thomson interprets it as the total number of bonds. Presumably 
this is largely dependent on some psycho-physiological and innate 
property of the higher nervous system, but there is no reason why 
the number should not be affected by the use made of the mind, 
and by organic conditions such as brain injury and ageing. This 
fits in with modem research on the highly individual nature of 
mental growth (cf. Dearborn and Rothney, 1941; Fleming, 1948), 
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on the efiects of schooling and the intellectual stimulus provided 
by people’s jobs (cf Vernon and Parry, 1949), and on deterioration 
of mental efficiency in pathological conditions. 

Compensation Theories. We have considered the relation of 
the hierarchical theory to Spearman’s and Thurstone’s views. 
How does it stand vts-a-vts the popular notion of compensation, 
and of opposed types of people? Actually it admits a large measure 
of truth in the contrast between the theoretical or academic and the 
practical, since they roughly describe our two major group factors. 
But these abilities are not inversely correlated, i.e. opposed, and 
they are independent only when the influence of ^ is ignored. Thus 
in fact the majority of children who are superior educationally are 
also above average in mechanical ability, in doing things with their 
hands, and even in physique, because of the common influence of 
g. The Norwood Report’s separation of academic and practical- 
technical types of children is unsound for several reasons, but 
chiefly because the child with high g who is likely to do well at a 
grammar school would also do well at a technical school (cf. Burt, 
1943). So far, however, we have been talking about average ten- 
dencies; and as correlations between intelligence, educational, 
practical and physical tests are far from perjFect, they admit of 
many exceptional individual cases. Thus there is a proportion of 
children who are suited to grammar school education not because 
their g is very high, but because of strong v:ed factor, and such 
cases may well be poor in k:m factor, and therefore unsuited to 
technical education. There are many cases too who are exception- 
ally well-developed physically, or good at athletics or at certain 
manual skills, whose g + *D:ed are low, who therefore conform to 
the layman’s stereotype. But it is none the less true that such cases 
are much rarer than those who are above average, or below average, 
all round. 

There are several reasons why the compensation theory gains 
such wide credence. First, whenever ^-saturations or correlations 
with educational attainment are low, bright or dull children will 
tend to be nearer the average in non-educational activities. The 
child who is very high in intelligence arid school work will usually 
be less high in manual, practical or artistic work, and the scholasti- 
cally dull pupil will on the average be less retarded in non-scholastic 
fields. This can readily be deduced by studying the scatter for a 
low correlation (cf. Vernon, 1940, p. 154). 
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Secondly, the groups with which we are most familiar above the 
age of 11+ are selected ones, and as already pointed out this 
reduces the ^-loadings and exaggerates the group factors. Hence 
it is quite conceivable that in a grammar school there may be an 
inverse, or at least a neghgible, correlation between, say, mathe- 
matical attainment and football. But if we could study the whole 
range of 15-year pupils we should find that grammar school pupils 
are usually superior to secondary modem pupils not only in 
mathematics, but also at football. 

Thirdly, we are concerned here only with abilities, not with 
interests. The latter probably show much stronger contrasts than 
the former. Thus the adolescent with keen interests in reading or 
other v:ed activities is frequently (perhaps more often than not) 
weakly interested in mechanical or athletic activities, and it may 
be that he devotes so little time to them that his potentially superior 
ability at such activities deteriorates Nevertheless, the university 
professor with his high^ can usually, if put to it, do better at things 
in which he is not much interested such as cooking a dmner and 
washing up without breakmg the crockery, than can a low-^ domes- 
tic servant. And it is by no means fanciful to suggest that the 
victories of the Jews in Palestme in 1948 over the Arabs (who tend 
to be more bellicose in interests) was largely due to their superior 
g and V 'ed, 

‘Slow but sure’ is another popular compensation theory, which 
likewise ignores the influence of g and other factors that tend to 
make the quick worker more rather than less accurate. However, 
It is considered in more detail in Chapter VII, and is shown there 
to possess a modicum of tmth. 

The notion of types of people, as distmct from types of ability, 
should also be discouraged. As Burt (1943) points out, there is no 
more justification for talkmg of an academic or practical type of 
child than for a tall or a short type. Just as the majority are inter- 
mediate m height, so there are many more who are about equally 
able in educational and practical activities than there are extreme 
cases. Ability types themselves are abstractions, smce many abili- 
ties when factorized will be found to be loaded on two or more 
group factors, i.e to be intermediate. But the grouping is more 
clear-cut than in the case of individuals, because it is often imposed 
by school syllabuses and other cultural mstitutions or norms. 

Note that our msistence on g does not involve any denial of 
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special talents m individual cases. Apart from such rarities as 
i^ots savants, there certainly exist children and adults of mediocre 
g and educational attainment who develop outstanding talents in 
the fields of art or scientific invention, or become leaders in busi- 
ness, politics, warfare, etc. Such talents can to some extent be 
attributed to the possession of strong group factors, but per- 
sonality influences, drives and interests are probably still more 
important. The analysis and measurement of such influences by 
psychologists is far less advanced than that of abihties. Thus the 
warning given at the end of Chapter I against regardmg factors as 
covering the whole psychology of human achievements should be 
reiterated. 
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ANALYSES OF EDUCATIONAL ATTAINMENTS 

Abstract. School marks yield a different structure from objective 
psychological tests because of the X-factor — 2 l complex of person- 
ality traits, interests and background. This, together with g and 
v:ed form the major influence in all educational attainments m un- 
selected groups of children and adults, though differentiation 
according to subject-matter can readily be established in selected 
secondary school pupils or university students. The more drilled 
and mechanical aspects of v (verbal) and n (number) abilities 
differentiate most clearly, but there is insufficient evidence to 
justify contrastmg ‘rote’ with ‘reasoning’ attainments. Many a 
priori classifications of types of readmg and number ability lack 
empirical substantiation. For example word-knowledge (voca- 
bulary) and comprehension in readmg come to much the same 
thing. However, mechanical, rate, vocabulary and comprehension 
aspects are partially distinguishable at advanced levels. 

The Industriousness Factor in School Marks. The psycho- 
logist’s V, n and other factors are usually based on tests which are 
fairly pure measures of the abihties at which they are aimed. A 
good vocabulary test, for example, should measure g, v, a small 
error component and very little else. Educational attainments, 
especially when measured by school or other examinations are 
naturally more complex, and we have already seen that a some- 
what ill-defined factor of mdustriousness + interest, which 
Alexander calls X, plays a prominent part. Similar factors, vari- 
ously called interest, study, or ‘halo’ have been reported in 
American investigations by Holzinger and Swineford (1939), Sisk 
(1940), Carroll (1943) and Comrey (1949). Because of this, selec- 
tion for secondary or other higher education by means of g^ v, n or 
other psychological tests alone is usually less successful than 
selection which also takes account of previous school work (cf. 
McClelland, 1942). At one time some psychologists did propose 
that children most likely to benefit from advanced schooling would 
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be those with the highest innate intelligence, rather than with the 
best attainments, but we realize now that this was short-sighted. 

Although interesting attempts have been made to measure or 
assess personality factors relevant to scholastic success, it is 
doubtful whether any are practically applicable on a large scale. 
When teachers' judgments are studied, some are found to give 
excellent predictions, but others are less competent than objective 
ability tests. The overall result (if we may accept McClelland's 
findings) is that such judgments add nothing worthwhile to 
estimates based on tests plus school marks, since anything of value 
which the average teacher knows about the industry, etc., of his or 
her pupils is already embodied in the marks. At the moment, 
therefore, we know little about X, though further research would 
certainly be profitable. In particular we would like to know how 
far it depends on: 

(a) home background; (b) the ‘tone' of the pupil's school; 

(c) the stimulatingness of, or good teaching by, his teacher; 

(d) the pupil's interests; (e) his temperamental characteristics. 

Overlap of School Examinations and Psychological Tests. 

That psychological tests and school marks often measure rather 
different things is shown not only by the imperfections of objective 
tests in picking good and poor scholars, and by Alexander's work, 
but by such investigations as the following. Bradford (1946) 
presents data for 105 technical school boys on five varied subjects 
and nine paper-and-pencil or performance tests. There is a 
general factor with 24 per cent, variance and a bipolar with 
16 per cent, separating all the school marks from all the tests. 
Another bipolar, distinguishing the more technical subjects and 
performance tests from the more linguistic subjects and tests has a 
variance of only 4 per cent, Drew’s results (cf. p. Ill) are similar. 
Blackwell (1940) compared mathematical achievements of 100 
secondary school boys and 100 girls with scores on spatial and 
verbal tests specially designed to measure the reasoning processes 
believed to be involved in mathematics. By rotation of axes she 
arrived at factors in which all types of measure are represented, 
but a study of her unrotated factors suggests that here, too, the 
mathematical marks, the spatial and the verbal tests, fall into 
relatively distinct clusters. However, it is probable that the con- 
trast is most marked in highly selected groups. For m a research 
by Kerr (1942) with 527 relatively heterogeneous twelve-year 
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pupils, five school subjects and four tests yielded no such factor 
as Alexander’s or Bradford’s. Instead a clerical test fell in a cluster 
with English and Language marks, and an intelhgence test in the 
Maths- Science cluster. Art marks and spatial-mechanical tests pro- 
vided additional group factors of their own. Comrey (1949) also re- 
ports complex overlapping between test factors and courses of study 
among West Point cadets. It should be noted that only when re- 
sults such as these are obtamed is differential diagnosis, i.e. the 
prediction of suitability for different types of school course, possible. 

Unitariness of We would expect attainment measures to 
correlate highly with one another because of their common g and 
X content. But there seems to be a common v:ed ability in addi- 
tion, since the isolation of distinctive sub-factors for different sub- 
jects is remarkably difficult among representative groups of adults 
and children. For example in the investigation of Army recruits 
in Table V, the correlation between spelling and dictation tests 
was no higher than the correlation of either with the verbal ability 
Test 25. (Descriptions of this and other tests used in the Services 
may be found in Vernon, 1947b, or Vernon and Parry, 1949.) 
Apparently spelling ability is little if at all differentiated among 
average adults from the fluency + vocabulary ability on which this 
test probably depends, although according to Thurstone (1948) 
spelling is a highly distinctive factor among college adults. 
Though verbal and numerical abilities are usually separable, as in 
Burt’s original research and in Table V, they tend to show a good 
deal of overlap, over and above g. Thus in Schiller’s (1934) in- 
vestigation of twelve tests among 395 pupils aged around nine 
years, the correlation between Arithmetic Reasoning and Com- 
putation was no higher than the correlations of both tests with four 
reading tests and tests of verbal^. 

Sub-divisions of v:ed, V :ed breaks down more readily into 
specialized abilities in populations that are more homogeneous in 
educational level, such as technical schoolboys or college students. 
Kerr’s (1942) research, already mentioned, gave a factor differ- 
entiating linguistic from mathematical-scientific subjects, though 
its variance was far smaller than that of the general factor Again 
Wilson (1933) analysed three sets of School Certificate marks and 
found, in addition to a general factor (presumably a mixture of 
g + X + v:ed), group factors for Arithmetic- Algebra- Geometry, 
for French-English and History-English, for Art-Needlework and 
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Art-Handicraft. The mean general and group factor loadings both 
approximated to *53, i.e. about 28 per cent, variance. 

It might be supposed that group factors tend to become more 
prominent relative to general educational ability with older pupils. 
Yet Wolf (1939), when trying to develop aptitude tests for different 
types of courses at Yale University, found almost as high correla- 
tions (averaging -45) between first-year examinations in arts and 
science subjects as between different arts or diiBferent science sub- 
jects (average • 59). Among postgraduate student teachers Vernon 
(1939) still found the general educational component predominant. 
His correlations can be most simply analysed into 2 Lv:ed factor with 
26 per cent variance, and separate group factors for science sub- 
jects — ^Psychology, Arithmetic, Hygiene, Nature Study — and 
practical subjects — Speech Training, Teaching Skill, Physical 
Training — ^with combined variance 12 per cent. The remaining 
subjects — Education, Geography, English and History, depended 
entirely on the general factor. Among Army engineering cadets, 
nine tests of attainment in different branches of mathematics and 
physics and two intelligence tests were analysed. The marks were 
found to depend only to 5*3 per cent, on^, but to 49*3 per cent, 
on a mathematics-physics educational factor. Additional group 
factors, covering 18*5 per cent., involved: 

^ (1) Lower maths — ^Arithmetic and Algebra. 

(2) Higher maths — ^Trigonometry, Calculus, Co-ordinate Geo- 

metry. 

(3) Physics — Mechanics, Heat, Light, Electricity. 

The unitarmess of v:ed is illustrated too by the success with 
which verbal or mathematical tests predicted ability among Ser- 
vice recruits at almost all jobs involving theory or bookwork, ex- 
cept the most highly specialized such as radio mechanics (cf. Vernon 
and Parry, 1949^ Similarly recruits who had held clerical jobs as 
civilians were usually excellent at verbal or mathematical jobs such 
as telegraphist, electrical mechanic, etc. No doubt there is a limit 
to this; thus a man who made a good interpreter would not 
necessarily be equally capable at the higher maths and physics 
needed by an electrical mechanic 

Rote vs. Reasoning Attainments. We have seen that a 
classification of attainments into linguistic and mathematical- 
scientific emerges fairly readily, together with further divisions at 
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more advanced educational levels. Another type of classification 
has been suggested, namely into attainments involving rote know- 
ledge, including spelling and mechanical arithmetic, and attain- 
ments involving more reasoning such as reading comprehension, 
composition, and mathematics. Burt first made this distinction 
among linguistic subjects, but failed to confirm it in his later in- 
vestigation of a larger group of ten-year-olds (p, 15). Sutherland 
(1941), working with 134 eleven-year-olds, found, in addition to 
g, V and n, a small group factor in spelling, mechanical arithmetic 
and a number series test, which he tentatively labelled a memory 
factor; also an ‘induction’ factor in problem arithmetic and number 
series. Conceivably this represents a distinction between the 
stages of schooling, the ‘rote’ subjects being those that are studied 
earliest in the school career. Possibly also there is a link with 
Thurstone’s W (word fluency) and V (verbal reasoning) factors. 
But it seems equally likely that the distinction arises merely from 
the higher ^-content of the ‘reasoning’ subjects. At least there is 
no conclusive evidence so far against this explanation. 

It follows that V and n are most readily isolated by rather ele- 
mentary tests. Thus Thurstone (1938a), Coombs (1941), Guilford 
and Lacey (1947) and others regard computation ability as most 
representative of their N factor. Similarly Vernon (1949b) found a 
mechanical reading and a spelling test to be more less g- 
saturated than two silent reading comprehension tests among 
15-year-olds. In the Services rote arithmetic (the first part of 
naval or Army Test 3) and spellmg or dictation were always much 
more strongly opposed to k:m tests than was mathematics (the 
second part of Test 3). In any moderately selected group the 
correlations were apt to sink to zero or negative values. It follows 
also that, in primary schools, children are much more likely to 
show unevenness or specific backwardness in elementary arith- 
metical or verbal attainments than they are in problem arithmetic 
and composition, or other higher, more ^-saturated, subjects. 

Arithmetical-Mathematical Ability. Let us turn now to 
possible divisions within the mathematics or the English fields. 
Most factorial studies tend to show much less differentiation than 
educationists are apt to suppose. In spite of the different g~ 
saturations of mechanical arithmetic and mathematics among re- 
cruits (approximately -55 and -77, respectively), the correlations 
between these tests were always so high that a broad arithmetic- 
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mathematics group factor appeared unavoidable. Oldham (1937~-8) 
claimed that Arithmetic, Algebra and Geometry yielded separate 
group factors among secondary and central school pupils, but it is 
difficult to see how her figures substantiate this. Although she 
used tests specially designed to avoid overlapping between the 
subjects, she obtained a common factor covering, on the average, 
57 per cent, of variance, very little of which was attributable to g 
(cf. also Wilson’s results, p. 39) More detailed testing could no 
doubt transform the ^-factors for Algebra, Geometry, etc., into 
group factors, but they would still be quite small. In other words, 
pupils with special flairs for, or special disabilities at, particular 
branches of mathematics are rather rare. What Oldham’s figures 
did demonstrate, however, was the great amount of variation in 
correlations in different school classes, which could be ascribed to 
the way the subjects were taught, and to the interconnections that 
teachers had established in their pupils’ minds. Effects of teaching 
on the structure of educational abilities was shown too in a study 
of 500 naval Air Mechanics, who took an ordinary school mathe- 
matics examination on entry to a training establishment, and, a 
few months later, an exactly parallel ‘progress’ test. They were 
coached in similar problems throughout the period. Both exami- 
nations were factorized along with naval tests, and the entry one 
resembled the mathematics part of Test 3 in its loadings, the 
progress one resembled the arithmetic part. The coaching had 
transformed the ability from the ‘reasoning’ to the ‘rote’ type. 

An interesting point established in Sutherland’s (1941) re- 
search, mentioned above, was that the familiar or unfamiliar set- 
ting of arithmetic problems does not affect their factor content. 
According to his rotations, tests involving familiar or unfamiliar 
situations obtain g, v (i.e general educational) and n loadings all 
close to *5, together with small loadings (*3) on his tentative 
‘induction’ factor. No investigations, to the writer’s knowledge, 
have shown any differentiation between mental and written 
arithmetic, or between money or other types of sums. 

Another research which attempted to define the essence of 
N was that of Coombs (1941), who analysed thirty-four tests 
among 223 high school pupils. He included several tests based on 
letters of the alphabet or shapes, which were designed to measure 
the same kind of functions as arithmetic tests. Actually their N 
loadings were all close to zero, showmg that the ability is specific- 
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ally related to numbers. Nevertheless, they did bear out the 
hypothesis that N involves the apphcation of a set of highly stereo- 
typed and practised rules. Tests based on shapes, i.e. relatively 
unfamiliar symbols, gave even lower saturations than those based 
on letters. Another conception, that of serial response, did not 
seem to be crucial; for it was actually the simplest two, three and 
four digit addition tests which had the highest loadings of all 
However, Guilford and Lacey’s work indicates that, in high-grade 
groups such as aircraft pilots, the tests which give the purest 
measures of N must not be too mechanical For they report higher 
saturations for subtraction and division than for addition and 
multiplication sums. This is borne out by the experience of 
British Army psychologists. 

Reading Ability or Abilities. As in arithmetic, so in English 
there are no definitely established classifications of sub-types of 
ability. We talk of literacy and illiteracy as though they constitute 
a general factor in all verbal subjects, and very likely we are correct. 
But we do not even know how far the reading and writing com- 
ponents of literacy are distinctive, nor whether creative com- 
position, knowledge of granamar, sentence structure and punctua- 
tion, or spelling are separable components of writing (cf Vernon, 
1949b). Harris (1948) lists correlations between fifteen readmg, 
writing and English measures in four groups of about fifty 
American Indian pupils. But they are far too irregular for any 
definite factors to emerge other than a strong g + v:ed one 
throughout, and possibly a contrast between the reading tests on 
the one hand and the usage and composition tests on the other. 

In the field of reading many different batteries of tests have been 
pubhshed. Each author analyses the total complex of readmg 
skills into different a prion components, for none of which is any 
empirical justification offered. For example, Burt and Schonell 
provide tests of word pronunciation, continuous prose, speed and 
comprehension. Gates’s tests for Grades 3 to 8 claim to measure 
Reading to Appreciate General Significance, to Understand 
Precise Directions, to Note Details, etc. Trigg’s series for Grades 
7 to 12 includes Vocabulary, Visual and Auditory Comprehension, 
Rate of Reading three types of material, and two tests of Word 
Attack. Hall and Robinson (1945) state that the correlations be- 
tween Gates’s tests, which are supposed to measure distinct skills, 
are as high as those between tests by different authors which are 
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supposed to measure the same skill. When moderate or low 
correlations are found among tests aimed at different aspects of 
reading, this is often due merely to the unreliability of the tests, 
and not to any real distinction between those aspects. 

Mechanical Reading. The existence of only moderate cor- 
relations between mechanical (e.g. word-pronouncing) and silent 
reading or comprehension tests is fairly well established, but we 
have already suggested that this may be due mainly to the higher 
^-content of the latter. Vernon (1938) correlated several reading 
tests and teachers’ marks among Scottish primary school pupils, 
and found his Graded Word test to have a higher reading factor 
saturation than either a comprehension or a speed test. Within 
the field of mechanical tests, there is some evidence that recogni- 
tion tests differ from pronouncing ones. Thus Dunlop (1942) 
reports a correlation of *83 between Vernon Word Recognition 
and McLaren Word and Picture Matching tests, among 6-year 
children, but correlations of *64 and *67 between these and the 
Burt-Vernon Graded Word tests There is as yet no justification 
for subdividing oral reading into speed vs. accuracy factors, or 
into ability with regular phonic and irregular words, or into 
pronunciation of isolated words vs. complete sentences, though 
these are all possibilities. 

Silent Reading. American investigators seldom use mechanical 
(individual) tests, but are more interested in differentiation among 
silent reading tests There is fairly strong evidence for partially 
distinct speed of reading, vocabulary or word knowledge and 
comprehension of sentences and paragraphs, factors at the high 
school and college levels, though at the same time there is a strong 
general factor. Gates (1921) quotes correlations for several groups 
of children (eight to fourteen year) in a single school for four 
reading comprehension tests, three rate tests, one oral, two voca- 
bulary, and a group intelligence and a directions, tests. The 
averaged figures, shown in Table VI, indicate that all types 
measure much the same thing at this level. Thus different 
comprehension tests correlate more highly with rate measures 
(from other tests) and with intelligence than they do with one 
another. However, both rate and vocabulary show some specific 
overlap, i.e. they constitute partially distinct group factors. And 
another test (omitted here) based on reproduction of material read, 
gave very low correlations with the rest, 
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Hall and Robinson (1945) claim to have separated speed, 
vocabulary and accuracy in a study of 100 college students, to- 
gether with an independent factor of ability to read and under- 
stand charts and tables. And they criticize reading tests which 
commonly involve a mixture of factors. Langsam (1941) similarly 
factorized twenty-one tests among 100 17-year students and ob- 
tained five factors which she identifies, not very convmcingly, with 
Thurs tone’s V,P, W,NzxidL The perceptual factor covered most 
of the speed of reading tests, W the vocabulary tests, and I the 
tests involving logical organization and selection of ideas. Her 
first, general V, factor had about twice the variance of all the other 
reading factors combined. 


TABLE VI AVERAGED CORRELATIONS BETWEEN DIFFERENT 
TYPES OF READING AND INTELLIGENCE TESTS (GATES, 1921) 



Compre- 

hension 

Rate 

Oral 

Vocabu- 

lary 

Intelli- 

gence 

4 Comprehension tests 

( 52) 

•55 

56 

•51 

*59 

3 Rate of Readmg tests 

‘55 

(•59) 

•52 

*47 

•50 

1 Oral Reading test 

*56 

•52 

( ) 

•39 

•53 

2 Vocabulary tests 

•51 

•47 

•39 

( 69) 

•52 

Intelligence and Directions 
tests 

•59 

•50 

•53 

•52 

(*61) 


Davis (1944) also attempted to sub-divide reading ability and 
criticized tests which involve a mixture of factors. From a survey 
of the literature he arrived at the following a priori components: 

(1) Knowledge of word meanings. 

(2) Recognition of appropriate meanings for words in particular 

contexts. 

(3) Following the organization of a passage and identifying 

antecedents and referents in it. 

(4) Recognizing the mam thought of a passage. 

(5) Answering questions that are directly answered in a passage. 

(6) Answering questions that are only indirectly answered. 

(7) Drawing inferences from a passage about its contents. 

(8) Recognizing hterary devices used, and getting the tone or 

mood. 

(9) Determining the writer’s purpose and point of view. 
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Tests of these components were devised and given to 421 
students without time limits, in order to eliminate any speed 
factor. On factorizing Davis claimed to isolate two main inde- 
pendent factors — word knowledge (chiefly in 1) and reasoning 
(chiefly in 6 and 7), together with several smaller factors corres- 
ponding to some of the other components. However, Thurstone 
(1946) re-analysed the data and showed that all correlations could 
be well accounted for by a single general factor, a blend of word 
knowledge and comprehension. 

Other Reading Factors. Cans (1940) finds that success in 
selecting reading matter for help in solving a problem is separable 
from comprehension. Feder (1938) claims that reading factual 
material for information, and reading for inference, are distinct. 
But, in common with many other investigators, he fails to show 
that these are consistent factors in themselves. Artley (1943, 
1944) reviews researches on reading tests m different fields of 
knowledge and concludes that although they show considerable 
overlapping, there are wide variations between pupils’ or students’ 
abilities m different fields. Here, too, the unreliability of the tests 
has seldom been properly controlled. Artley himself obtained a 
correlation of -785 between general reading vocabulary and voca- 
bulaiy in a special field — Social Studies. It is noticeable also in 
Hall and Robinson’s research, which included reading tests in 
several fields such as Geology, History and Art, that none of the 
factors differentiated the tests according to subject matter. How- 
ever Greene (1941) lists correlations between vocabulary tests in 
eight fields- Human Relations, Commerce, Government, Physical 
Science, Biological Science, Mathematics, Fine Arts and Sports. 
These average only -27, whereas the average reliability of the 
separate tests is *84 Tests containing technical (e.g. mechanical) 
matter also appear to show the influence of content fairly readily. 
Thus those devised in the U.S.A.A.F. received mechanical load- 
ings of the same order as their F-loadings. Comprehension tests 
are liable to show more overlapping than vocabulary ones, and 
there is a real danger in the exclusive use of objective achievement 
tests of the comprehension (multiple-choice) type for assessing 
pupils’ or students’ knowledge. All their marks are likely to be 
greatly influenced by a general vocabulary + comprehension 
ability at such tests (cf. the discussion of test ability factors, 
p. 76f). 
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Conclusions. No further factorial evidence seems to be avail- 
able regarding the practical subjects that Burt distinguished, 
though we shall see in later chapters that the k:m factor probably 
links up with scientific ability, and that there may be an aesthetic 
discrimination factor relevant to certain subjects. Fig. 3 attempts 



to portray the main findings so far. Although the picture that it 
gives of mental structure is certainly an advance on Fig. 2, it still 
cannot hope to do justice to the complex interconnections of 
different subjects, and to their variations among groups of different 
educational levels, or taught by different methods. 

Note that g, X and other relevant personahty, interest and 
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physical factors, together with vied are placed in a central ‘com- 
plex’ which constitutes general educational ability. This affects all 
branches of all subjects. The influence of particular interests, 
traits or physical conditions on particular subjects, could not be 
shown. Vied subdivides into v and n, which branch into the vari- 
ous linguistic and mathematical-scientific subjects. Each such 
subj’ect, It may be assumed, would yield its own small group 
factor if appropriately investigated. An attempt has been made 
to place the more specific attainments, those which are usually 
least dependent on general educational ability, furthest from the 
centre; also to place furthest apart those attainments which tend 
to show the lowest correlations. 



CHAPTER V 


INTELLECTUAL FACULTIES 

Abstract. The general conclusion of this chapter is that, though 
small group factors can be isolated fairly easily in many types of 
specialized cognitive tests, no intellectual faculties beyond g and v 
are yet established as having much educational or vocational im« 
portance. W (word fluency), F (ideational fluency), I ox L (in- 
ductive or logical reasoning) and many other minor factors have 
been partially separated in selected groups, though often coalescing 
into g vm. other researches. The / group factor may be linked 
with certain personality trends. If there is a separate imaginative 
or creative ability, it seems impossible to measure it. The dis- 
covery of other abilities may well be stimulated by clinical re- 
search with mental patients, but so far there is no evidence that the 
commonly used deterioration and other clinical tests measure new 
factors. Apart from rote-memorizing and attention-to-directions 
group factors, memory and attention must also be regarded as 
unsubstantiated faculties. 

Introduction. Most modern psychological textbooks avoid 
using such terms as memory, attention, imagination, reasoning, 
etc., as though these constituted separate faculties. But we have 
seen that they still bulk largely in the lajman’s and the education- 
ist’s discussion of human mental make-up. Very commonly, for 
example, objections are raised to objective or new-type attainment 
tests because they are said to measure memory for focts, and fail 
to bring out the understanding of principles, or the capacity for 
applying knowledge, or the originality and constructiveness, 
which find expression in the old-fashioned essay examinations. 
Similarly these tests and ordinary timed verbal intelligence tests, 
applied at eleven years, are said to select the ‘slick’ type of pupil 
for grammar school education and to handicap the pupil with more 
profound intellectual qualities who would make a better scholar. 
While it is by no means proven that none of these ‘types of mind’ 
or faculties exist, the evidence summarized below shows that they 
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should not be accepted until experimental demonstration is forth- 
coming. Indeed the only thoroughly established ability of this 
kind, over and above is the ©-factor, which we will consider first. 

V factor and its Sub-divisions. Verbal ability is usually in- 
extricably bound up with education, and impregnated with our 
X-factor. For example two representative groups of Army recruits, 
totalling 1 ,570 men, took a test designed to measure v, a test of 
clerical ability and an arithmetic-mathematics test, they were also 
assessed for education received. The inter-correlations of these 
four measures were all so high and so similar, lying between -753 
and -807, that they obtained almost identical factor loadings. 
Since the educational assessment was based merely on length of 
schooling, without any allowance for goodness of schools attended 
or for education obtained by private study, it would seem that © 
is determined more by upbringing than by any special linguistic 
aptitude (cf., however, p. 32). 

A number of American investigations do throw some further 
light on its nature and on possible sub-divisions. Unfortunately 
these are less reveahng than they might be since American V 
usually embodies such a large admixture of g. Thus some writers 
regard it as essentially verbal reasoning, though the majority seem 
to agree that it is most effectively measured by straightforward 
vocabulary tests. 

In heterogeneous populations it is a broad and closely integrated 
factor. Thus Cureton (1947), who studied large groups of school 
pupils and U.S. Army clerical workers, claims that there is no 
factor in paragraph reading, verbal analogies, proverb-matching, 
mixed sentences and reasomng tests which is not covered by 
vocabulary tests. Thurstone, however, claimed independent ver- 
bal, word fluency and inductive reasoning abilities in his 1938 
research, and has since extended this division down to the six- 
year level (T. G. Thurstone, 1948). Among children he admits 
that all these factors overlap with one another, and with the N 
factor. Eysenck’s (1939) group-factor analysis of Thurstone’s data 
likewise revealed separate © and word fluency group factors, 
though Holzinger and Harman’s (1938) group-factor analysis was 
less congruent, since it grouped the chief W tests under a ‘com- 
pletion ability’ factor. 

On the other handWoodrow(1939b)foundno IT, only a single V, 
when he analysed fifty-two tests given to 110 students, although 
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several possible tests of fluency were included. Incidentally he 
also found that four of the five component tests of the George 
Washington Social Intelligence Test measure scarcely an 3 rthing 
but The Division of Occupational Analysis's (1945) hst of 
factors also omits W\ but some of its verbal tests obtained loadings 
on the perceptual and speed, as well as the V and general factors. 
The Spearman-Holzmger unitary trait study also yielded a single 
V among 13-year children, though some of the minor group factors 
such as imagination (i) or mental speed (a) may correspond to 
Thurstone's W (Holzinger, 1934-5). Johnson and Reynolds 
(1941) analysed ten verbal tests and identified the two factors that 
emerged as flow of responses, and selection of correct responses, 
which appear to correspond to W and V, Thornton (1939) men- 
tions a fluency factor in his research into tests of persistence; but 
it seems to be merely the verbal and g element in the tests he 
happened to use. 

More recently Thurstone (1948) has stated that three verbal 
factors should be distingmshed, and suggests that they are selec- 
tively affected m different types of aphasia* 

F=understandmg of verbal material. 

IF= fluency in finding words to fit a restricted context. 

jP=ideational fluency with words. 

This conclusion seems to fit in reasonably well with the several 
investigations of verbal tests listed below, though it is quite im- 
possible to reconcile them all. Since they mvolved extracting and 
rotating up to ten factors with populations of one to two hundred 
students, it would be hopeless to expect consistent results 
Fruchter (1948) re-analysed the correlations of twenty of Thur- 
stone 's original tests and found, in addition to F, W and the other 
primary factors, a ‘speed of calhng up pertiuent associations’ 
factor in Inventive Opposites, Controlled Association and Com- 
pletion tests. Taylor (1947) describes four verbal factors. 

Verbal comprehension (in Sames-Opposites, Sentence Com- 
pletion and Mixed Sentences tests); 

Verbal versatility, or ability to express an idea by several diffei • 

^ The wnter has found moderate overlapping between this battery and extra- 
version-introversion questionnaires. Possibly this connects with Catteli’s 
(1936) statement that fluency tests give good predictions of ‘surgency*. 
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ent combinations of words (in Similes, Making Sentences 
from Initial Letters, Completing Unfinished Stories); 

Word Fluency, involving no reference to the meaning of words 
(in Anagrams, Words ending with ‘tion’, Words beginning 
with S, etc.); 

Ideational Fluency or production of words through meaningful 
associations (in Writing Themes, Names of Things that are 
Round, Adjectives to Describe a House, etc.). 

Finally Carroll (1941) discovers eight verbal factors, though one 
of these may be largely motor (Maximum and Normal Speed of 
Oral Reading), and another is concerned with coherent oral ex- 
pression. He narrows down V to ability to learn and retain conven- 
tional linguistic responses (e.g. Grammar, Vocabulary, Spelling, 
Rhymes), and separates a Verbal Relations factor (Recognition of 
Roots of Words, Rearrangement of Syllables), and a third some- 
what amorphous factor. W he splits into A — speed of word 
association m a restricted context (e.g. Colour Naming, Suffixes, 
Dis-arranged Words), E — rate of production of syntactically 
coherent discourse (e.g. Number of Words in a Written Theme, 
Number of Relevant Words in Describing a Picture), and a third 
Naming factor or ability to attach appropriate names to stimuli. 

Oral Fluency. There seem to have been few factorial studies 
of speech characteristics apart from CarrolFs. Gewirtz (1948) 
applied a number of oral fluency tests to thirty-eight 5-6| 
year children, and suggests that there are different fluency 
abilities in restricted and unrestricted contexts. Most of the cor- 
relations between tests of his first type (Finding Rhymes, Words to 
Follow Tn the — ’, Children’s Names, etc.) appear to be wholly 
attributable to ^ + e; as measured by Stanford-Bmet and a Mixed 
Sentences test. But the fluencies shown in describing one’s own 
home, in giving adults’ names, and in saying as many things as 
possible, yield a distinct factor. All tests tended to correlate 
positively with behaviour ratings on Gregariousness, Competitive- 
ness, Curiosity and Originality, negatively with Social Appre- 
hensiveness and Patience. 

Fluency and Personality. Fluency has been approached from 
a different angle by a number of London psychologists — Har- 
greaves (1927), Cattell (1936), Stephenson and Studman (1934), 
etc. They have been interested in it mainly as a temperamental 
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trait, which differentiates various types of mental patients, or 
which may be related to the popular conception of extraversion, 
rather than as an ability. The tests generally used for measuring it 
include Numbers of Words Beginning With — , Names of Animals, 
Numbers of Four-letter Words, Inkblot Responses, Associations 
with Pictures, Writing Themes, Normal Speed of Reading, etc. 
Several of these use non-verbal stimuli, though the responses are 
verbal, and Taylor (1947) suggests that they are factoriaUy com- 
plex. Certainly they correlate positively, but Hargreaves doubted 
whether they embody any common factor over and above speed, 
and memory group factors. Holzmger (1934-5) included a number 
of them in the unitary trait study, and found that they resolved 
chiefly into g and v in one group of children. In other analyses he 
does claim a small distinctive imagination group factor (/). 
Another possibility which requires investigation is that W derives 
merely from the simpler and more highly drilled, less ^-saturated 
aspects of verbal ability, i*e* that IF== V—g. 

Creativeness. How about an imaginative, creative or construc- 
tive faculty? So far our evidence suggests that, apart from g and v 
there is nothing but this rather dubious fluency factor, or set of 
factors. An aesthetic discrimination or hterary taste factor is 
mentioned below (p. 93), though this too appears to be very small 
when^ and v are held constant. Knowledge of grammar, punctua- 
tion and sentence structure might constitute another partially 
distinct aspect of v. If all these were measured it is doubtful 
whether any further creative factor in the writmg of English could 
be isolated. 

Obviously this answer will not satisfy teachers, but the difficulty 
in distinguishing such a factor is that it is almost impossible to 
measure creativeness reliably. The disagreement between 
examiners when markmg the ‘higher’ (as opposed to the more 
factual) qualities of essay papers is notorious. True, the work 
summarized in Chapter VIII shows that some consensus of 
opinion as to the aesthetic merit of literary, visual or musical com- 
positions can be established, but no one has yet extended this 
method of investigation to pupils’ or students’ compositions. They 
are still marked mamly for g, v and knowledge factors. 

It is desirable here to distinguish between the pedagogical and 
psychometric viewpoints. The wnter entirely agrees that the ex- 
clusive use of objective intelligence and attainments tests for 
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selection at 11+ has had a most undesirable backwash, stimulating 
teachers to confine their instruction to the sort of problems set m 
these tests. Everything possible should be done to encourage 
creative work, including the writing of continuous prose, and to 
develop inquiring and reflective attitudes of mind. But it is a 
different matter to claim that any generally recognizable quality of 
originality exists and can be marked or measured, when it depends 
so largely on subjective taste. 

We also lack any proof that such a quality is really relevant to 
secondary school work. The writer is very willing to be converted 
by experimental evidence, but doubts whether predictions which 
take creativeness into account would be any more accurate than 
predictions based on objective tests + ordinary school marks to 
cover the X factor. 

Reasoning. Spearman himself admitted specific overlap, or a 
small group factor, in logical reasoning, though this was based only 
on two tests, given to sixty- three students, and so need not be 
taken very seriously. Thurstone^s primary factors included one 
main type of reasoning denoted as / (Induction), and two more 
uncertain factors — D (Deduction) and R (Reasoning). In younger 
groups these appeared to amalgamate, and Thurstone’s followers 
such as Davidson (1945), Taylor (1947), Fruchter (1948) and others 
have usually found only one. Now reasoning ability is one of the 
commoner definitions of intelligence, and we would therefore ex- 
pect, if we allow a g factor, that g would include the whole of the 
variance of reasoning factors, together with part of that of T, W, 
S, etc. Both Holzinger and Harman (1938) and Eysenck (1939) 
show that this »s the case, and that in only two tests out of Thur- 
stone’s fifty-six is there some specific overlap which might be 
termed logical reasoning. Holzinger does mention an additional 
Analogies group factor, but it is probably insignificant. In their 
work at the high school level, the Thurstones admit a high corre- 
lation between I and second-order g, though this was not con- 
firmed in another experiment which included a wider variety of 
induction and deduction tests (Thurstone, L L., 1940; Thurstone, 
L. L. and T. G., 1941). Among British investigators Ormiston 
(1939) claimed a logical reasoning factor in three verbal reasoning 
tests, but her method of rotation of axes was dubious. As already 
mentioned, Sutherland (1941) found a small induction factor; this 
too might have been better incorporated in g. Dempster (1948) 
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describes a bipolar factor separating ‘Reasoning’ from ‘Appraisal’ 
tests in more than one investigation, but he does not give sufficient 
information about the tests or correlations to enable us to assess its 
status. 

More convincing is the Division of Occupational Analysis’s dis- 
covery of a Logic factor in several verbal and non-verbal tests 
involving formal rational solutions to problems, in addition to their 
general factor content. But it is clear that the investigators rotated 
their factors in such a way as to minimize g. In the absence of 
published figures, we cannot tell whether a different set of rota- 
tions might not have absorbed most of this into g. Since the 
General Aptitude Test Battery issued by the Division omits this 
factor, it is apparently regarded as having no vocational importance. 
The same may well be true in the educational field. A small 
reasoning or logic group factor could be isolated from specialized 
tests, but it would be unlikely to add an 3 rthing to measures of g, v 
and n in the prediction of the reasoning ability desirable among 
secondary pupils or college students 

U.S.A.A.F. Investigations of Reasoning. The most elaborate 
work IS that of Davis and Guilford among U.S.A.A.F. candidates 
and trainees. Davis (1947) tried out fourteen tests designed to 
measure ‘practical judgment’ on 150 high school boys. Tests with 
mechanical or spatial content were, as usual, differentiated from 
verbal ones, but the latter appeared to yield separate group factors 
of; 

(1) Logical reasoning in Syllogisms, Reading or Arithmetic 

Problems. 

(2) General vocabulary. 

(3) Pure judgment and reasoning judgment. 

It is difficult to discern the difference between (1) and (3), but 
Davis suggests that the latter requires the calling to mind of 
pertinent information, in other words that Judgment involve 
something in the nature of Fluency, In default of analyses along 
with g and W tests, it seems unsafe to draw any conclusions. Davis 
stresses the complexity and the number of independent factors 
involved in reading, but this may be due merely to his use of 
Kelley’s factorial technique on highly unreliable tests. ^ Actually 

^ The Principal Axes technique inevitably turns every specific factor (includ- 
ing error vanance) into an independent common factor. 
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the average correlation of tests falling within groups (1) and (3), 
namely -249, is scarcely any higher than the average of -223 
between these groups. 

Guilford and Lacey’s (1947) list of reasoning factors includes 
the following, in addition to V, N, mechanical, spatial and other 
influences in certain tests: 

General reasoning in mathematics tests and several verbal 
judgment and non-verbal tests. 

Reasoning II in non-verbal analogies and Gottschaldt Figures. 

Reasoning III in spatial reasoning tests and decoding (cypher) 
tests. 

Planning of routes on maps, through mazes, and in electrical 
circuits. 

Judgment in practical situations, making common-sense deci- 
sions. 

They point out that none of these conforms to logical categories 
like Thurstone’s Induction and Deduction. Considerable caution 
is needed in evaluating Guilford’s factors, since many of them were 
established only in smallish groups (2 to 300) of highly selected 
aviation students, and are of doubtful reliability (cf. p. 131). It is 
noticeable that in the larger analyses with populations of several 
thousands (which admittedly included few reasoning or judgment 
tests) only a single General Reasoning factor emerged, and even 
this tended to overlap with F. Only one analysis was carried out 
on a relatively unselected group of 689 high school boys. This was 
re-analysed by the present writer, using group-factor technique, 
and it was found that a g (with some 22 per cent, variance) + 
group factors for verbal, mechanical and spatial-perceptual tests 
(totalling some 12 per cent.) gave almost as good a fit as Guilford’s 
six multiple factors — F, Mechanical, Visualization, General 
Reasoning, Reasoning II and Judgment. 

Reyburn and Taylor’s Experiment. One investigation which 
appears at first sight to provide crucial disproof of a single g and 
evidence for several distinctive intellectual faculties, is that of 
Reyburn and Taylor (1941). Ten varied intelligence tests were 
analysed among 1,497 South Africans, aged 12-18, and although 
all correlations were positive, many were so low that the 
only statistically satisfactory solution was a set of five multiple 
factors. The content of these was very puzzling, only one — -a 
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verbal factor — appearing to coincide with the findings of other 
investigators. Yet with so large a number of carefully tested sub- 
jects, we cannot afford to ignore this. The writer would suggest 
that the wide age range has distorted the coefficients, and is 
responsible for their generally low level. For example an Absurdi- 
ties test and Porteus Mazes which would normally correlate to 
about *4 in a twelve-year group, correlates here only *047, prob- 
ably because ability at Absurdities may go on mcreasing up to 18, 
whereas ability at Mazes may reach a maximum and even begin to 
decline before 18. Another disturbing condition is that about half 
the group was Enghsh-speaking, half Afrikaans. The correlation 
would be reduced if the former were superior in Absurdities, the 
latter in Mazes. 

Factors in Mental Deterioration. A case might be made out 
for a factor distinguishing what Cattell (1943) calls fluid and cry- 
stallized abilities The former is an individuars effective mtelli- 
gence at new problems, which declines with age and is reduced by 
brain injury and other pathological conditions The latter consists 
of long-established discriminatory habits which are less, or not at 
all, subject to deterioration Since, however, the most representative 
tests of the former type are those most saturated with g such as 
Matrices, Abstraction, Wechsler Similarities, etc., whereas tests 
of the latter type include Vocabulary, Information, Comprehen- 
sion and Arithmetic, there do not seem to be sufficient grounds for 
positing any new factors beyond g, v:ed and (as will be shown 
in Chapter VII) speed. Indeed, susceptibility to deterioration 
might provide us with a useful external cnterion, beyond the 
purely statistical one, as to which tests are most representative 

Neiiropsychiatric Faculties. This brings us to a thorny topic. 
What do the innumerable clinical tests employed by neurologists, 
psychiatrists and chnical psychologists measure? Neuropsychiatric 
literature is extremely prone to talk of such faculties as memory, 
concentration, conceptualization and orientation, and to state that 
one or more of these is affected by orgamc disorders, lesions or 
psychoses. Numerous unstandardized tests are employed in 
addition to scientific scales like the Wechsler-Bellevue, for example 
Serial Sevens, Bender Visual Gestalt, Vigotsky Blocks, Repeating a 
Story in Own Words, Naming Six Large Cities (cf, for example, 
Curran and Guttmann, 1945). It is argued that even though many 
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of these may not yield accurate measures of established factors, 
they provide valuable clinical insight into the patient’s disturbed 
intellectual functions. In the writer’s view (Vernon, 1949c), re- 
search with such tests should have considerable exploratory value 
in suggesting distinguishable mental capacities, but it should be 
followed up by factorial studies like that of Thurstone in the field 
of perception (p. 89). The two approaches should be comple- 
mentary. The argument that every patient is a different individual 
who cannot be measured and fitted into a neat factorial framework 
is dubious. For if it is true, then the clinician should not talk about 
conceptualization, concentration, etc., as these also differ m every 
individual If he is going to use such concepts in describing large 
numbers of patients, their distinctiveness and consistency must be 
objectively demonstrable. There can be no doubt that many such 
tests measure nothing but g and v and valueless specifics. At the 
same time the factorist may have much to learn from the clinician. 
Many syndromes such as impaired retentivity for recent events, 
the decreased hnsight’ of the paranoiac, etc,, are thoroughly estab- 
lished and should provide pointers to factors which would be worth 
measuring (cf. Hsu, 1948). 

One attempt to analyse clinical tests is that of Halstead (1945, 
1947). He claims that thirteen tests given to fifty patients yield 
four distinctive aspects of intelligence, or else (applying Holzmger’s 
group-factor technique) one general and three group factors. The 
general factor which he calls C (Central Integrative) obviously 
corresponds to g, being highest in a group intelligence test, an 
Abstraction test, Speech Discrimination, etc. But it seems doubt- 
ful whether the other factors are statistically significant. They have 
no clear psychological meaning, and Halstead fails to show that 
they are differentially affected by different psychopathological 
conditions. 

Other Analyses of Intellectual Qualities. An original view of 
intellectual qualities has been put forward by Meili (1946; cf. also 
Myers, 1947). Though using Thurstone ’s centroid technique, his 
factors are somewhat uncertain, being based mainly on individual 
tests given to several small groups of subjects (thirty to fifty) of 
of various ages. He rejects g, and is not concerned with such 
factors as F, iV, or S, which he regards as arising from the external 
characteristics of the tests. Instead he finds four main factors 
which are aspects of, or together make up, intelligence, namely: 
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(1) Plasticity: the breaking down and reorganization of struc- 

tures. This seems to resemble Thurstone^s second per- 
ceptual factor closely (cf. p. 89). 

(2) Complexity ability to realize complex intellectual structures. 

(3) Fluency. This corresponds to Thurstone’s unrestricted or 

ideational F factor. 

(4) Globalization: uniting separate data into a single whole, an 

essentially creative capacity. 

It will be interesting to see whether these can be confirmed by 
more extensive investigations. 

One other tentative approach to the analysis of intellect which 
deserves mention is that of Earle (1948). He has not used factor 
analysis, but has compared the results of numerous intelligence 
and other tests given at 10 to 13 years with the subsequent per- 
formance of the children at various types of secondary school 
course, and so arrived at a working classification. Though re- 
gardmg g as the main source of individual differences in scholastic 
abilities, he considers that the followmg sub-types or group factors 
tend to differentiate, probably under the influence of interests and 
temperament. 

(1) Knowledge of words and comprehension of sentences. 

(2) Logical reasomng, seeing relations between objects and 

ideas, recognizmg and describing attributes of persons or 
thmgs. 

(3) Seemg relations between, and carrying out operations with: 

(i) numbers, (li) shapes. 

(4) Comprehending the structure and functions of shapes, 

mechanisms and other objects; dealing with practical 
problems. 

Clearly Nos. (1) and (4) correspond to v and k *m factors. No. (3) 
links k and n. This seems doubtful at so early an age, though it 
does occur among older pupils and students (cf, p. 73). No (2) 
IS the faculty which has been criticized in this chapter. A large- 
scale analysis of Earle’s Duplex tests, which yield scores for g and 
for these types of ability, is badly needed. 

Memory. One of the most popular faculties, both among 
educationists, psychiatrists and laymen, is memory, Spearman 
(1927) admitted that g enters into many learning and reproductive 
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activities, but regarded retentivity as an entirely distinct mental 
function. Moreover, he did not expect overlap between retentivities 
for different materials, except in so far as the materials were 
closely similar. Ingham (1949) gives a valuable review of the com- 
plexities of measuring learning ability and retentivity, and con- 
cludes that Spearman was not far wrong. But g does enter into all 
memory activities, particularly when the material is meaningful, 
and a fairly broad rote memory factor can be recognized in 
addition. 

As early as 1920, Smith and McDougall found correlations of 
•S3 between two tests of logical memory, and *61 between two of 
habit memory, but coefficients around zero between tests of these 
two types. A memory for prose test partook of both types. They 
claimed thus to substantiate Bergson’s distinction between habit 
and meaningful memory. These results were obtained with 
forty-one students only and they have not been confirmed, but it 
is likely that the correlation between their logical tests represents g, 
and that the overlap between their habit tests is due to a separate 
factor. Several subsequent researches cast doubt on the existence 
of any memory factor or factors. Thus in Holzinger’s unitary trait 
study (1934“S) the memory tests resolved into g, v and a (mental 
speed). But in a later publication Holzmger (1938) demonstrated a 
group factor in immediate memory span for words, sentences, 
digits and pictures, Eysenck and Halstead (1945) applied fifteen 
of the commonly used clinical tests of ^memory’ to sixty mental 
hospital patients, and found that a single factor, identical with g, 
accounted for the whole of their overlapping. However, this result 
was at least partly due to the unusual heterogeneity of their patients 
in^. Again, Bryan (1934), working with 200 kindergarten children, 
found as close correlations between eleven memory tests and 
Stanford-Binet and Vocabulary as among the memory tests 
themselves. 

Nevertheless a large number of investigators have obtained clear 
rote memory group factors, including Thurstone, L. L. (1938a, 
1940), Thurstone, L. L. and T. G, (1941), Woodrow (1939b), 
Carroll (1941), Wittenborn (1943), Taylor (1947), etc. Most of 
these factors were based on digit or sentence span, paired associate 
or recognition tests, none of which bear much resemblance to 
learning, retention, or recall in everyday life. Kelley’s (1928) 
factor, found among 13-year, 9-year and kindergarten groups, 
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extended to verbal, numerical and visual material. But the tests for 
each type of material were exactly similar in form. Words, num- 
bers or pictures were shown to the children, who then picked out 
which items they had seen on their answer sheets. Anastasi (1930, 
1932) obtained a small but significant group factor in several paired 
associate tests and m recognition tests of previously presented 
words, syllables and forms. But as soon as she tried to extend this 
to learning and retention of logical material, delayed memory for 
words, reproduction of movements, and Seashore’s tonal memory 
test, the correlations mostly became negligible. Since she worked 
with somewhat homogeneous groups of college students, none of 
the tests except those for logical memory showed appreciable cor- 
relations with a verbal g test. In the U.S A.A.F. also rote memory 
factors were readily estabhshed, but a delayed memory test for 
logical and visual material showed no loadings on these factors, 
only on V and Visualization (A). 

The broadest factor so far described is that of Ingham (1949), 
who gave eight paired associate tests, individually, to eighty Army 
recruits, and included nonsense and meaningful words, pictures 
and forms. Each test was scored in four ways, for immediate 
memory, for speed of learning, for retention after thirty minutes 
(given a constant amount of initial learning), and for time saved in 
relearning. A single memory factor, as well as g, ran through all 
these scores. The average variance of both factors was 12 to 
13 per cent., but^ was more prominent in learning and immediate 
memory scores, and the memory factor more important in reten- 
tion and saving scores. 

Some sub-divisions of rote memory have been discovered. Thus 
Guilford and Lacey found different factors among paired associ- 
ates tests and tests involvmg study and immediate recall of details 
on maps. The Thurstones (1941) state that the factor involved in 
tests involving temporal sequence differs from that in paired 
associates. Carlson (1937) attempted to study rote vs. logical, and 
visual vs verbal memory factors. But as his material consisted 
solely of word-recognition tests, the general factor at which 
he arrived and the sub-factors for Vocal’, Visual’ and ‘meaning’ 
tests can only be accepted as specialized rote factors. Similarly 
Brener (1940) factorized seventeen tests of memory span among 
forty students and obtained, in addition to a prominent general 
factor, some rather doubtful group factors, one of which seemed 
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to be mainly verbal, another visual or spatial. Some further work 
on sensory memory tests is described m Chapter VIII. 

Attention. Finally we must consider the evidence for a faculty 
of attention or concentration. At one time Burt (1909) and Mc- 
Queen (1917), finding that tests which demanded the greatest 
attentiveness tended to have high ^-saturations, concluded that g 
and attention are identical. Holzinger (1934-5) did discover a 
group factor {t), over and above ^ in certain tests involving listen- 
ing to the experimenter and following directions. For example, the 
tester reads ‘D 2 3’ and the subjects count mentally and write down 
F I, the second and third letters after D. He admits that it was the 
smallest of his group factors, and the most irregular in his different 
groups of subjects. Woodrow (1939b) used oral and written direc- 
tions tests, and found a distinctive factor in these and in tests of 
arithmetic, cancellation and copying figures off a chart. Tliis would 
appear to bear a close resemblance to Thurstone^s perceptual speed 
(clerical) factor, discussed later. However, Wittenborn (1943), in 
an investigation with 175 Air Corps trainees, included several 
complex oral directions tests demanding sustained concentration 
For example, lists of digits were read out (on phonograph records), 
testees having to write a cross if the first in a list was the largest, 
the second the smallest, or to make appropriate responses to odd 
or even numbers. Other tasks were based on lists of vowels and 
consonants. These tests gave a prominent factor distinct from P, 
M, S and AT. How far it involves g and v was not indicated. 

Much the same concept is implied by Guilford and Lacey’s 
work on ‘integration’. They claim not one, but three integration 
factors in tests where subjects have to learn a number of rules and 
hold these in mind while deciding the appropriate responses to 
problems. Thus the best test of the 71 factor shows aircraft carriers 
flying various flags, and the carrier from which planes should take 
off depends on the numbers of flags, the directions of the ships 
relative to the wind, and other instructions 72 factor appeared in a 
printed directions test and other tests involving quick adaptation 
to new instructions. 73 occurred in certain planning and reasoning 
tests where numerous considerations had to be integrated. It 
appeared to represent breadth rather than strength or flexibility of 
attention. While we can admire the ingenuity of the tests devised 
by U.S A.A.F. psychologists for measuring the mental traits con- 
sidered important in aircrew, and the care with which they were 
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validated, we can legitimately ask for confirmatory evidence from 
less highly selected groups before we accept these as distinctive 
and practically useful group factors. 

Many teachers appear to think that most of their troubles would 
be ended if they could develop their pupils’ attentiveness. Prob- 
ably however what they mean by attention can be resolved into: 

(a) ^ ^ or in other words the children’s mental ages. For it 

is well known that lack of concentration is most prominent in 
younger classes. 

(b) the X faptor, which includes the pupils’ interests in each 
school subject As already pomted out this is a function of the 
school or teacher as well as of the pupil, and wandering of attention 
should often be attributed to the teacher’s failure to make a subject 
sufficiently interesting rather than to the pupils’ traits. 

(c) a relatively tiny group factor such as that isolated by Hol- 
zinger, Woodrow and Wittenborn. It is conceivable that their 
tests might give valuable predictions of educability, over and 
above g and v.ed tests, but the only evidence so far is negative 
(Wittenborn & Larsen, 1944). More probably their attention-to- 
directions group factor is, like the rote memory factors, confined 
to too narrow a type of test to have much spread to other attention 
situations. 



CHAPTER VI 


VERBAL AND NON-VERBAL FACTORS IN 
INTELLIGENCE TESTS 

Abstract, All intelligence tests measure some group factor or 
factors based on their type of material, m addition to^ and specifics. 

factor is very prominent in verbal tests, in Stanford-Binet and 
Terman-Merrill, but this enhances their predictive value for most 
educational and occupational purposes. Spatial or k tests are 
nowadays distinguished from intelligence tests, but there is no 
clear dividing line, and non-verbal g tests — abstract or pictorial 
— usually show a small spatial-perceptual component. At the same 
time the group factors are seldom sufficiently marked to justify 
using intelligence tests as measures of abilities other than g. The 
Stanford-Binet, for example, does not give reliable diagnostic 
indications of verbal, numerical, memory, spatial or other abilities. 

Research findings are often contradictory since the incidence of 
spatial or perceptual factors varies with the selectivity of the sub- 
jects and their sex, the ease of the tests, and with the pre-supposi- 
tions of the factorist. The theory that k does not differentiate till 
about 14 years is not borne out. K contrasts strongly with n in 
children and dull adults, but they tend to link up at high-grade 
levels, presumably because of scientific education. The only sub- 
factor branching off from k that is well established is perceptual 
speed in matching or identifying details of shapes or pictures. This 
may link with the clerical group factor — a sub-division of v, though 
both are rather unstable. Perceptual speed is not responsible for 
the non-g content of tests like Progressive Matrices. 

V in Verbal Intelligence Tests. Most psychologists were 
content to test intelligence by means of verbal problems until the 
1930s, on the quite plausible grounds that the highest achieve- 
ments of man’s intellect are usually reached through verbal sym- 
bols and concepts. Nevertheless it was realized quite early that 
performance at such tests was influenced by linguistic develop- 
ment and education, and many investigations such as Gordon’s 
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with canal boat children proved this. Hence scales of performance 
tests, and group tests based on shapes or pictures, were con- 
structed as early as 1917 in an attempt to give the non-English 
speaking or illiterate recruit, the deaf child and other verbally 
handicapped individuals, a fair chance. Kelley showed in 1927 
that an ordmary verbal group test measures to the extent of 90 
per cent, the same thing as a summed battery of scholastic achieve- 
ment tests. This might of course be attributed to the achievement 
tests depending over-much on g, particularly if the instructions 
and the form of the objective items are unfamiliar to the pupils. 
Hence correlations with attainment as measured by ordinary 
school marks are usually somewhat lower. But at least as likely an 
explanation is that intelligence tests involve much the same 
linguistic capacities as achievement tests. A study of conventional 
intelligence test batteries shows indeed that they often include 
vocabulary and sentence completion items, that is precisely the 
same kind of material out of which silent reading tests are com- 
posed. • 

Recent research indicates that nearly half the communality of 
many group verbal mtelligence tests consists of v rather than g, 
but that some types of test are less z?-saturated than others (cf. 
Vernon, 1947b). Abstraction tests, for example, whose problems 
are based more on letters and on word forms than on meanings, 
seem to have a ^-variance of some 65-75 per cent , and v only 5 per 
cent. Number Series tests, again, have quite small w-loadings. 
Presumably the size of the group factor depends on the extent to 
which it is fostered at home or in school. Hence it is large in any 
test depending on comprehension of words and sentences. And it 
might become equally large if schools set out to tram pupils in 
answering Abstraction and Number Series problems. There can 
be little doubt that the coaching for group intelligence tests which 
is nowadays so common at 11+ has reduced the ^-variance and 
increased the group-factor variance of such tests. 

Factors in Non-Verbal Intelligence Tests. Next the problem 
arises whether similar group factors are present in non-verbal tests. 
Here there is a vast quantity of somewhat conflicting evidence, and 
it will be far from easy to pick our way through it to clear-cut con- 
clusions. As already mentioned, Kelley (1928) found a distinctive 
factor both among 13- and 9-year pupils in two tests involving 
memory for shapes and two based on turning shapes around 
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imaginally. Even among kindergarten children, though the results 
were less definite, a spatial factor appeared in a test of memory for 
shapes and a simple formboard. The Thurstones (1938a, 1941, 
1948) included numerous spatial tests in their primary mental 
abilities investigations and obtained a factor which they call 5, 
obviously the same as El Koussy’s k, even as early as 5 to 6 
years. It was most marked in tests involving imaginative mani- 
pulation of shapes. Both Eysenck and Holzinger and Harman, 
reworking Thurstone’s figures, agree that such tests measure g and 
k ox S, the average variance of both factors being in the neighbour- 
hood of 25 per cent. 

At the same time the distinction between spatial and other non- 
verbal group tests is by no means as clear as El Koussy (cf. p. 17) 
believed. Tests such as Cube Counting and Paper Formboard 
appear to involve imagination of shapes, and have obtained large 
^-loadings in many experiments. Yet they were originally de- 
signed as parts of the Army Beta test for measuring intelligence 
non-verbally,^and were included by Stephenson (1931) in the 
battery whose inter-correlations he attributed solely to^ (cf. p. 17). 
Emmett (1949) recently reanalysed El Koussy’s figures and showed 
that several visual tests, together with mechanical tests and wood- 
work marks, have almost as high ^-loadings as the original eight 
tests. Though Alexander (1935) and Drew (1947) accept Spear- 
man’s and Stephenson’s assumption that non-verbal g tests de- 
pend only on g, their results accord at least as well with the view 
that they contain a small spatial component. In numerous analyses 
in the British Services, the Progressive Matrices test and the 
National Institute of Industrial Psychology’s Group Test 70 
have obtained small k:m loadings (cf. Tables V, VII, IX). Prob- 
ably it is the spatial and not the mechanical aspect of k:m ability 
which is involved (cf. Table IX). Williams (1948), on the other 
hand, found distinctive verbal, mechamcal and spatial group 
factors in his investigation of 12 to 14 boys, but his non-verbal g 
tests were loaded only with^. 

Spatial Factors in School Subjects. Another dubious point is 
whether k enters into geometry or other school subjects. From his 
extensive work in developing intelligence tests for the selection of 
college students, Brigham (1932) was one of the first to note verbal, 
numerical and spatial group factors in such tests. And he found 
much higher correlations between spatial tests and subsequent 
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performance m drawing (descriptive geometry and mechanical 
drawing) than between spatial tests and other subjects, or between 
verbal tests and drawing. Similarly Smith (1948), who used 
several of El Koussy’s tests and obtained results analogous to his 
among 100 13 -year-olds, claims A-loadmgs in art, geometry and 
engineering drawing, though not m handwork, marks. On the 
other hand Blackwell (1940) and Holzinger and Swineford (1946) 
do not find any overlap between spatial tests and geometry, apart 
from g. The one pomt on which all workers agree is that girls or 
women are poorer than boys or men m k, and the sex difference 
has been used as a sign that a test is measuring this factor. 

Reasons for Divergent Results. No doubt different methods 
of factorization and different arbitrary rotations account for some 
of the discrepancies in the results of different investigators. The 
use of groups of subjects of differing degrees of selectivity also 
probably plays a part. It is not unlikely, for example, that k is less 
differentiated from g among duller testees. Subjects of different 
sex, age, training and general ability may often tackle the same test 
items by different methods (cf. Thurstone 1938b). Thus many of 
the Items in the Matrices test can be done largely by verbal logic, 
or by spatial imagery, and possibly even by visual matching (per- 
ceptual factor). Hence the irregular appearance of k in tests sup- 
posed to measure g is not surprising. 

Space Factor and Age. The role of the age factor has been 
stressed by Slater (1940, 1941, 1943), though again there is no 
general agreement. In a series of researches he apphed verbal, 
non-verbal g and spatial tests to 82 and 211 children aged 
11+, 161 aged 13+ and 89 trade apprentices aged about 
18. In the third group there was a clear /e-factor, which extended 
through almost all the spatial, and the mechamcal, tests. But he 
claims that, in the younger groups, the only factor besides ^ is a 
verbal one, and that spatial tests measure just the same thing as 
non-verbal g tests. He concludes that k tests cannot be used at 
11, or even at 13, for selection for technical education. It 
should be recognized however that, just as in Stephenson’s re- 
search (cf. p. 17), it would be equally legitimate to postulate a 
separate group factor in the spatial and the non-verbal g tests. 
Although he obtained a good fit with two factors there is nothing 
to stop us extracting a third. Adcock (1948) has done this, using 
both Thurstone’s multi-factor, and group-factor, techniques. He 
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finds clear v and k factors in addition to g. According to his results 
the non-verbal g tests, Matrices and Group Test 70 Parts I and 
III, have some 7 per cent, of >i-variance, whereas the definitely 
spatial tests have about 16 to 30 per cent, variance. Emmett (1949) 
similarly, after grouping together several of Slater’s tests, found 
three statistically significant factors, and some though not all of the 
non-verbal g tests certainly involved k. Another feature of Slater’s 
investigations was that his younger groups consisted half of girls, 
half boys, whereas his older group was all male Several researches, 
described later, indicate that k is less differentiated in females than 
in males. Emmett suggests also that most of the tests were not 
well suited to young children. 

The evidence for a space factor around 11-13 and earlier, from 
El Koussy ’s, Kelley ’s,Thurstone’s and other investigations is indeed 
overwhelming, Kerr (1942), Dempster (1948), and Williams (1948) 
may also be cited. Peel (1949) gave nine tests to three groups of 
70-80 boys and girls aged around 11, 12|- and 131-, and in 
each group his second, bipolar, factor contrasted two performance 
tests with three verbal tests. A spatial test based on detecting 
faults in patterns, not on manipulation of shapes, approximated to 
the performance type, and two non-verbal g tests were usually 
intermediate between the verbal and practical-spatial. Emmett 
(1949) factorized four verbal and numerical, three non-verbal and 
two spatial tests among 178 eleven- and twelve-year boys. 
The spatial tests yielded a distinct factor, one which involved 
three-dimensional judgments gaining a higher loading (and 
less g) than the other which involved two-dimensional. The 
non-verbal g tests did resemble the spatial tests more closely than 
the verbal ones, but after rotation they appeared to depend solely 
on g. He also refactorized Mellone’s (1944) figures, which had 
suggested the presence of a spatial factor in picture tests given to 
218 seven-year boys, but not among 196 girls. The same factor did 
now appear among girls, though it was smaller and rather irregular. 
Omitting two educational tests the g and k variances were 24 • 3 and 
12 T among boys, 26 • 6 and 9 • 4 among girls. 

Pictorial Intelligence Tests. Though several of Mellone’s 
picture tests were clearly spatial (e.g. Mirror Images and Cube 
Counting), and others with very low ^-loadings were non-spatial 
(Substitution and Directions), yet others which were more pictorial 
than manipulative were found to measure k as well as g. No in- 
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vestigation seems to have discovered a pictorial factor in pictorial 
group intelligence tests (though Burt suggests the existence of such 
a factor in certain performance tests; cf, p. 109). Most of the 
picture tests like Mellone's, Otis Alpha, Cattell, etc., seem to be 
rather unreliable measures of g and to possess large specific com- 
ponents, or else to evoke a small amount of A. 

Perceptual and Clerical Group Factors. Another possibility, 
which we must next consider, is the existence of a perceptual 
group factor, distinct from k — a specialized ability to solve prob- 
lems based on abstract diagrams, which might enter into such non- 
verbal^ tests as Progressive Matrices. 

A P factor of Perceptual Speed was first described by Thurstone 
in tests involving rapid visual inspection and identification of 
letters, numbers, words and shapes. In a more extensive study of 
this factor (1938b), the most saturated tests included selecting 
common word associations, classifymg words under headings 
(e.g. flowers, clothing, etc.), picking out the highest number in a 
column, and other tasks of a kind often included among clerical 
tests. Cancellation and arithmetic tests also obtained small load- 
ings. But non-verbal material of the same type seemed to be more 
saturated with S Qi) thanP. Other investigators, however, such as 
Dvorak (1947) and Guilford and Lacey have concluded that the 
factor is most prominent in tests involvmg the matching of pictures 
or shapes. In fact the Division of Occupational Analysis postu- 
lates two separate factors — Q in tests of a clerical type, simple 
arithmetic and coding, andP in matching figures or distinguishing 
slightly different pictures and shapes. A few tests suoh as com- 
paring lists of numbers partook of both P and Q, but there is as 
yet no evidence as to whether they are linked. 

In a British Army investigation, a clerical factor was discovered 
as an offshoot of v:ed. An analysis of twenty tests among 300 
clerks yielded g, k:m, v and n factors with variances 28 • 3, 3 • 3, 5 *4 
and 6*7 per cent., and a clerical factor with variance 7*7 per cent, 
in four clencal tests and in the A.T.S. spelling and arithmetic tests 
which involve the checking of right answers. Jorgensen’s (1934) 
correlations among some 150 college students likewise indicated a 
small group factor in spelling and m four clerical subtests, beyond 
g and V. But in more heterogeneous groups the factor seems to be 
absorbed into v:ed. For example, there was no sign of it in the in- 
vestigation of Table V, although three tests suitable for measuring 

F 
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it (together with two perceptual^ tests) were included. It is note- 
worthy that in the Thurstones’ researches with fourteen-year-olds, 
P appeared only irregularly and tended to merge into Word 
Fluency or Space. Again in an analysis of sixteen of Thurstone's 
Primary Mental Abilities tests by Goodman (1943), the tests 
meant to measure P resolved chiefly into V or S, 

A fundamental investigation by Thurstone into types of per- 
ception does not seem to throw any light on the nature of non- 
verbal P. It is described in Chapter VIII. But several other re- 
searches must be mentioned here. 

Other Investigations of the Perceptual Factor, Ten non- 
verbal reasoning tests were given by Blakey (1941) to 286 15-18 
year pupils and five factors extracted. There was no sign of a ^ 
factor, although several of the tests were spatial in character. The 
only clear factor besides g was one that differentiated four tests 
involving matching or identification of patterns and pictures, and a 
type of substitution test, from the rest. Probably the tests, and 
Blakey’s other factors were rather unreliable, and they may have 
been distorted by the wide age range. Thus no firm conclusion 
can be drawn here, except that non-verbal P and k are somehow 
connected. 

During the war, an investigation was made of fourteen tests 
among 500 candidates for naval radar which included, besides 
verbal tests, Matrices, Group Test 70 (all parts), an Oscilloscope 
Reading test involving rapid matching of oscilloscope pictures. 
Test 2 Mechanical and four spatial tests. The intercorrelations of 
these eight tests were almost wholly accounted for by g and k:m, 
but there was a slight contrast between the first three and the last 
five, which might mean either that Matrices, 70 and Oscilloscope 
involve less k than the others, or that they bring in a distinct P 
factor. As Matrices and Test 70 Parts II and III are given with 
fairly generous time limits (Matrices often with no limits), this 
might well obscure any factor of perceptual speed. However, Test 
70 Part I and Oscilloscope Reading are speeded tests, and these 
showed no residual correlation. 

Correlations between thirteen American tests for radar operators 
among 100 trainees are given by Lindsley (1943). An analysis by 
the writer suggests that, in addition to a prominent general factor 
and a group factor in several tests of graph readmg, there is a 
distinctive perceptual factor in the Oscilloscope Reading test men- 
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tioned above and in three tests depending on identification of 
irregxilar visual shapes. 

In another analysis of R.A.F. aircrew aptitude tests, mostly 
borrowed from the U.S A.A.F., a definite perceptual or observa- 
tional factor appeared in three tests involving matching of aerial 
photographs, of photographs and maps, and of aircraft silhouettes, 
and (to a smaller extent) in table and dial reading tests. It also 
showed some loadings for mechanical information and compre- 
hension tests and aviation information. This very wide content 
suggests that it derives mainly from aviation interests and 
A.T.C. training, rather than from any primary psychological 
aptitude. However, the same three matching tests form the basis 
of Guilford and Lacey’s perceptual speed factor, and in the 
U.S.A.A.F. it is linked, not with aviation interests, but with 
several tests usually presumed to involve k. No test of the clerical 
type had loadings of more than about -30 with this factor. 

The most relevant study is one where seventeen tests were 
analysed among 645 ground recruits — a group fairly representative 
of the general population. This included Test Obs-C — ^the 
aircrew test of matchmg silhouettes (which closely resembles the 
Division of Occupational Analysis’s P tests) — ^two tests of dial or 
scale reading, two non-verbal g tests, a clerical and three k tests. 
The results shown in Table VII do indicate the presence of a 
perceptual-clerical factor, but its variance is so small (an average 
of 3 per cent, in seven tests, whose mean commxmahty is 68 per 
cent.), that it cannot be regarded as having much statistical or 
practical significance. In a subsequent group-factor analysis of the 
same battery, not yet published, it appeared chiefly in scale and 
dial reading tests, clerical and Obs-C. Though not present in 
non-verbal g tests, it was shown to affect a few of their easiest items. 

We may conclude then that there probably is a non-verbal per- 
ceptual or observational factor in certain visual matching or identi- 
fication tests, which may be fairly prominent among selected groups 
but is very small in unselected populations. It would appear to be 
an offshoot of, or to be linked with, both the spatial or k factor and 
the clerical sub-factor of v 'ed. But it probably does not play any 
important part in non-verbal or perceptual g tests, particularly 
when these are of a power rather than a speed variety. 

Conclusions Regarding Intelligence Tests. In general, no 
test can claim to measure nothing but^ (and error variance). The 
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type of material used for expressing intellectual ability, whether 
verbal or non-verbal, always imposes some group factor, though 
this may be fairly small if the material is unfamiliar, as in verbal 
abstraction tests and visual tests of the Matrices type. Possibly 
pictorial tests are least likely to be biased either verbally or spatially 
but so far these have been constructed only for young children, 
and seem to be too unreliable to provide a promising source of g 


TABLE VII. ANALYSIS OF 17 TESTS AMONG 645 R.A.F GROUND 
RECRUITS (Rotated Centroid Factors, loadings less than *075 omitted) 


Test 

g 

n ed 

V 

Perc 

k 

Mech 

Inf 

h^ 

Calculations test 

•61 

•70 





•87 

Anthmetic test 

•62 

•67 




08 

•84 

Test 119, Scale and Graph Reading 

•88 

•33 





•87 

Test Ins-A, Dial Reading 

•80 

•32 





•74 

Test V-4, Verbal 

•67 

40 

40 

•09 



•78 

Gen-Al and SP 14, Spelling tests 

•59 

•31 

*45 




65 

Gen-A2, Reading Comprehension 

•59 

*29 

•38 

14 


•17 

•63 

SP Test 21, Clerical 

•74 

•35 

•25 

•22 



79 

Obs-C, Matching Aircraft Sil- 








houettes 

•59 



•34 

•30 


•56 

G-5, R A.F Matrices, and Pro- 








gressive Matrices I 

69 

•27 


•11 

•30 


•66 

K-6, Spatial test 

•66 

•18 



•57 


80 

SP 4, Squares spatial 

52 

•16 



*51 

•15 

57 

Group Test 80, spatial 

*68 



•09 

•44 


•66 

SP 117 E & M, Electrical and 








Mechanical Information 

•51 

•21 



•13 

•55 

•62 

Mec-C, Mechanical information 

•67 

•17 


•15 


•52 

•78 

MeC“B, Mechanical Diagrams 

•70 


•08 


•12 

42 

•69 

SP 122 Practical Problems 

•63 




•25 

•33 

*57 

Variance per cent. 

43-8 10 7 

3-5 

1*4 

6-3 

5 5 

7M 


tests for older children and adults. For most scholastic, and many 
vocational, purposes verbal group tests are far more useful than 
any non-verbal tests, because scholastic attainment is itself so 
largely a matter oi v,ed as well as g. Non-verbal tests, whether 
pictorial or abstract, may be fairer to persons whose education has 
been disturbed, but they will seldom give as good predictions of 
educability or trainabihty. Their main use should be for research 
purposes, where it is desired to separate off group factors from g. 

Stanford-Binet and Terman-Merrill Scales. Several analyses 
of these scales have been carried out by Burt (1939b), Wright 
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(1939), McNemar (1942b), Burt and John (1942) and Hammer 
(1948), These agree in showing that a gener^ factor carries most 
of the variance, but that there are numerous small group factors 
whose make-up depends on the age level, i.e the particular series 
of items, chosen. Most usually found are a verbal factor (e.g. in 
vocabulary), numerical (m digits, counting, and giving change), 
or alternatively an immediate memory factor, and a spatial- 
pictorial factor It is doubtful whether any of these are sufficiently 
clear-cut or sufficiently consistent over a wide age range to justify 
testers in using the scales diagnostically, e.g. in claiming that a 
child IS ‘good on the verbal side’, or ‘weak in memory’. So far no 
one appears to have analysed the scale among children with other 
reference tests. If this were done it would almost certainly be 
found that the general factor is partly composed of v, Thomson’s 
(1940) analysis of Stanford-Bmet and eight performance tests 
among eleven-year-olds suggests that its g variance is about 
50 per cent One might guess therefore that the rest consists of 
v'ed 25 per cent., other group factors 10 per cent., specificity and 
unreliability 15 per cent Alexander included Stanford-Bmet in his 
factorial study of adult women, and analysed it into ^ 44 per cent., 
V 27 per cent., F (practical) 4 per cent., specificity 25 per cent. The 
group-factor content certainly differs at different ages, which 
means that the test does not always measure the same thing. 
Nevertheless it seems to be as useful as, or more useful than, any 
group test among children, partly because its g and vied content 
is so high, partly perhaps because it is less affected than group 
tests by artificial ‘formal’ factors (cf. p. 76) 

The Spatial Factor and Mathematical-Scientific Ability. 
Some further work on the space factor may be appended here 
Among children and dull or average adults, k and n tests are 
strongly opposed, apart from g (cf. p. 34), But at high-grade 
levels an interesting alteration occurs (as Brigham seems to have 
realized in 1932). Both k and non-verbal g tests tend to link up 
with mathematical ability, and n becomes detached from the vied 
cluster. Probably this is due to the mfluence of science training 
both on mechanical-spatial and mathematical abilities Thus 
among Vernon’s (1939) Training College students, Stephenson’s 
non-verbal g test correlated more highly with arithmetic and 
science subj'ects, a verbal g test with education, geography and 
history. Similarly among Army engineering cadets, the corre- 
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lations of advanced mathematics and physics were greater with 
Matrices, while the more elementary arithmetic and algebra 
correlated more highly with the verbal Group Test 33. Fmally, 
among 540 candidates for the higher Civil Service, the two 
factors shown in Table VIII bring out the mam features of 
ten psychological tests, four examinations and two gradings of 
education. The first contrasts all the tests with the educational 
measures, while the second contrasts verbal tests and academic 

TABLE VIII. ROTATED CENTROID FACTORS FOR TESTS 
AND EXAMINATIONS TAKEN BY 540 CANDIDATES FOR 
THE HIGHER CIVIL SERVICE 


Test 

Tests vs. 
Acadeimc 

V vs. 
h:k 

CISSB Test 70/1 

•55 

-09 

Reading Comprehension 

•55 

•15 

Verbal Fluency 

•24 

*58 

Current Affairs 

•15 

•37 

CISSB Test 6, Verbal 

•12 

•43 

Qualifying Intelligence Test 

•47 

•40 

Qualifying Verbal 

•40 

•24 

Qualij^ng Instructions 

•35 

•00 

Qualifying Orientation 

•54 


Qualifying Cube-Counting 

•54 


Anthmetic Examination 

•57 


Examination, General Paper 

—•05 

•29 

English Precis 

— 09 

•28 

English Essay 

— 14 

•27 

Length of Education ratmg 

— 11 

•40 

Education Standard ratmg 

•01 

•47 


measures with spatial tests and arithmetic. An additional group 
factor, not shown here, linked the General Knowledge paper with 
the Current Affairs test. 

Sub-divisions of the Spatial Factor. Attempts have been 
made to sub-divide the spatial factor. Both with nine-year and 
kindergarten children, Kelley (1928) obtained a factor which 
appeared to involve memory for shapes rather than imaginative 
manipulation. Peel (1949) observes that some non-verbal tests 
possess a distinct aesthetic quality and wonders if this might prove 
relevant to the selection of craftsmen, as contrasted with techni- 
cians. He is investigating this further. The Thurstones in their 
high school study (1941) noted a small factor in several tests 
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apparently involving visual pursuit, e.g. Mazes, but they have not 
followed this up. 

At a very different level, namely, highly selected aircrew, Guil- 
ford claims three spatial relations factors, a visualization factor, a 
length-estimation factor, and perceptual speed. Visualization 
occurred mainly in mechanical comprehension tests, though also 
in some k tests such as Paper Folding, and a test based on verbal 
descriptions of painted blocks of cubes. SZ is confined to Thur- 
stone’s Hands and Flags tests. SI was found in psychomotor tests 
of reaction time and complex co-ordination, in instrument and 
dial reading, and in certain spatial group tests; and 53 was another 
rather curious mixture. In his later publications Guilford {1948ab) 
appears to identify Visuahzation with the conventional k and to 
define 51 as appreciation of spatial directions from the body (up 
down, to from, right left). But when Guilford and Zimmerman 
(1948) constructed tests for measuring these two factors separately, 
they correlated to about *5 among college students. Thus until 
much more confirmatory research is available, one would suggest 
that this plethora of spatial factors is more confusing than helpful. 

The relationship of k to mechanical abilities is discussed in 
Chapter X. An attempt to incorporate the main findings of this 
and the previous chapter in a diagram of mental structure is given 
at the end of Chapter VII. 



CHAPTER VII 


PRACTICE, DIFFICULTY, SPEED, AND OTHER 
FACTORS 

Abstract, Several unintentional factors, or extraneous condi- 
tions which influence factor content, are considered m this 
chapter. The form of the test item has not been shown to be of 
great importance, but it is possible that all objective (selective- 
response) tests embody a formal factor, which detracts from their 
educational or occupational value. 

Factor content is found to alter with practice, but no clear trend 
is discernible, unless the practice is directed towards stimulating 
some ability. The more difficult, and the easier items of any one 
test may be answered by different methods by the more and less 
able testees, or else measure different abilities. Wrong responses 
to some tests may likewise show different factor content from 
right ones; and their predictive value should be followed up 
separately. 

Speed of work can be partly distinguished under appropriate 
conditions from level, accuracy, or power, both with intelligence 
tests and (more readily) with simpler cognitive or motor tests. 
There is little evidence so far that a speed factor disturbs the 
predictive value of ordinary time-limit tests, except perhaps among 
adults of wide age range. 

Factors Deriving from the Form of the Test. We have seen 
that there is little ground for the supposition that there exist a 
number of intellectual faculties, of educational importance, which 
are not covered by objective intelligence and attainment tests. 
The discrepancies that occur between verbal intelligence or attain- 
ment tests and school performances were attributed mainly to a 
temperamental or interest factor, X, Another possibility, which 
merits investigation, is that the form of objective tests, as distinct 
from their content, introduces an extraneous group factor of ‘test 
ability^ and that this reduces the predictive value of such tests. 
Guilford (1940) considers that formal factors are unimportant, 
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citing Smith’s (1933) work, where fourteen tests were analysed 
among 186 students. Roughly one-third each were verbal, 
numerical and spatial; also about one-third were in completion 
(creative) form, one-third analogies and one-third classification. 
Using tetrad analysis, Smith concluded that any differentiation 
according to form is obscured by the stronger content factors, or 
else by unsuspected specific linkages. However, it is possible to 
apply Burt’s group-factor method to the correlations unaffected 
either by content or form, and to extract a g with some 35 per 
cent, variance. This leaves verbal, numerical and spatial content 
factors with about IS per cent, variance, and Analogies and 
Classification formal factors with about 7 per cent , but no group 
factor for the creative tests. Thus this research suggests a 'test 
ability’ factor or factors m the selective-response tests only. If 
this is confirmed, it would help to explain why the Stanford- Binet 
and Terman-Merrill scales appear to have better predictive value 
in daily life than do multiple-choice group tests. We know that 
larger practice effects occur on most group tests than on Terman- 
Merrill, and that these effects tend to spread from one test to 
another (cf thesummary in Vernon and Parry, 1949). Presumably 
testees become sophisticated to the instructions and the kinds of 
Items used in all selective-response tests of intelligence, English, 
or other attainments; and as they differ in their degree of sophisti- 
cation, this creates an artificial group factor. 

Further evidence of formal factors was obtained by the writer 
in an analysis of forty-two items of the Progressive Matrices test, 
along with twenty-one other tests which served to define the 
factorial content of the items. The communality of the item 
factors approximated 40 per cent., and of this only some 24 per 
cent, could be regarded as content factors {g, v, k, perceptual, etc.) 
Both figures are unusually low because the testees were a rather 
homogeneous group of 640 recruits. But the difference between 
them, 16 per cent., probably gives a fair estimate of the variance 
attributable to formal factors. Part of this amount represents a 
'difficulty’ factor (cf. below), introduced by the imposition of a 
time-limit, for some testees spend more time on the early items, 
some more on the later ones. The rest appears to arise from a 
common form factor in all the items and from separate formal 
group factors in each of the sets of items of which the test is com- 
posed. 
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Effects of Training and Practice. Several investigations have 
shown that factor content may be modified by training (pp. 42, 116). 
Anastasi (1936) claims that group factors can be produced or 
obliterated by experience, on the basis of a study of five tests 
among 200 children. After taking two forms of the tests, the child- 
ren were given specialized training in new methods of answering 
three of them; e g. they were shown how to employ spatial devices 
in certain reasoning tests. Some ten days later they took two more 
forms of the tests, and the inter-correlations and factor patterns 
showed marked alterations. 

The effects of practice without directed training are somewhat 
obscure. McNemar (1936) found higher correlations between five 
psychomotor tests after three of them had been practised inten- 
sively. The first factor variance rose from 29 • 8 to 37 • 5 per cent. 
There was little alteration in a second, bipolar, factor. Woodrow 
(1938) had tests of arithmetic, anagrams, cancellation, length 
estimation, drawing 'gates’, and others practised by fifty-six 
subjects for thirty-nine days, and then factorized initial, final, and 
gain scores. In another research (1939a), eighty-two subjects 
practised four tests sixty-six times, and initial and final scores were 
analysed along with several tests of known factor content (given 
once) which helped to identify the factors. Both studies showed 
marked differences between the loadings of initial and final tests. 
Woodrow concluded that not only their common factor, but also 
their specific factor content had largely altered. He failed to find 
any general improvement factor, but the gain scores tended to 
correlate positively with N and P, that is with the factors under- 
lying the operations which were practised. They also showed 
negative correlations with his Attention factor. This suggests, 
quite plausibly, that the least 'attentive’ subjects were able to gain 
most with practice. 

Heese (1942) gave an adding test and five psychomotor tests ten 
times to fifty students, and found positive but low correlations 
between the gain scores. Like Woodrow he denies a general im- 
provement factor, but the three rotated factors that he prefers 
seem to have little meaning. In another study by Greene (1943), 
four parallel forms of twelve miscellaneous tests (aiming, tapping, 
mazes, etc.) were given to 394 14-15 year boys. The first and last 
forms were analysed separately. The last forms showed more 
overlapping and stronger group factors (communalities of 49*7 
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and 59 ‘4 per cent,, respectively), and it is claimed that the factor 
patterns altered. But in the absence either of the original corre- 
lations or of the unrotated factors, it is difficult to say what, if any- 
thmg, this investigation proves. Melton (1947) presents a factor 
analysis of seven psychomotor tests among 350 aircrew candidates, 
where several trials were given and were treated as separate varia- 
bles. He claims that the factor content of the tests altered consid- 
erably even over five to ten minute periods, some factors showing 
higher and some lower loadings as the trials progressed. But in view 
of the low reliabihty of the scores for separate trials it would 
appear, to the writer at least, that there is no more alteration than 
might be expected by pure chance. 

Alterations of Factors with Diflficulty. Several writers have 
drawn attention to ‘difficulty factors’, but their significance in 
practical mental testing has not yet been worked out clearly. 
Hertzman (1936) studied the correlations between the scores on 
the easier and the more difficult halves of seven tests and, though 
he did not arrive at any factors, was able to show that the two 
halves often measured somewhat different abilities. Ferguson 
(1941) points out that if items in a test (or sub-tests in a battery) 
are homogeneous in content yet different in difficulty, a spunous 
factor or factors will be introduced, over and above the general one, 
contrasting the difficult and easy items (or tests). This assumes 
that inter-correlations are calculated by product-moment or point 
coefficients. Wherry and Gaylord (1944) show that this can be 
overcome by substituting tetrachoric correlations, and recommend 
that this technique of correlation should be generally adopted for 
factorial investigations, unless all tests yield normal score distribu- 
tions. However, this is not the whole story. Thus Guilford (1941) 
analysed correlations (using tetrachorics) between the ten sets of 
items in Seashore’s Pitch Discrimination test, and found that the 
most difficult sets (5 to 0.5 cycle discnmmations) measured quite 
different factors from the easier sets (30 to 8 cycles). One would 
have thought these sets to be highly homogeneous in content, but 
it is clear — either that the difficult ones measure a different pitch 
ability from the easy, or that the more able subjects use different 
methods of discriminating from the less able. 

An investigation by the writer of intelligence and educational 
tests at 75 per cent, and 25 per cent, difficulty levels (cf. p 30) 
yielded much the same pattern of group factors among the more 
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and less able, but showed larger group factor content, relative to 
g, among the latter. Shaefer (1940) makes the interesting sugges- 
tion that the perceptual P factor emerges only at a fairly low 
difficulty level, more difficult items of the same kind (or the same 
items answered by less able testees) presumably bring in reasoning 
or other ‘higher’ factors He does not say whether he has ex- 
perimental evidence, but he may be inferring from the different 
results of presumed P tests when given to adult students and to 
high school pupils (p. 70). 

While we shall probably continue to base our studies of test 
content chiefly on product-moment correlations between (approxi- 
mately) normally distributed scores, we should certainly not neg- 
lect the indications of these researches that content may differ 
considerably at different levels. 

Factors in Right and Wrong Answers. Another fruitful 
suggestion of Guilford’s is that right scores and wrong scores on 
the same tests should be analysed separately, unless they correlate 
with each other more highly than, say, *80 (when corrected for 
attenuation). In several U.S A.A F. studies, scores based on the 
numbers of wrongs had different factor loadings from scores for 
rights, and different validities for aircrew selection It follows that, 
with multiple choice or selective-response tests, guessing cor- 
rections should not be applied arbitrarily. The most appropriate 
weightings for rights and wrongs should be determined by follow- 
up research. In several of the A A.F tests of careful numerical 
or clerical work, a separate factor appeared, mainly in the error 
scores, which Guilford and Lacey identify as ‘Carefulness’, 
though they fail to provide any external evidence of its nature. It 
would be worth following up error scores on tests given at ll + > to 
see if they improve selection for secondary schools, and help to 
eliminate the hypothetical ‘facile-ness’ to which grammar school 
teachers object (cf. p. 49). 

Speed. By far the most important and most controversial of 
these supplementary or unintentional factors is that of speed. 
Ever since group tests were introduced complaints have been 
raised against the speed element. These have not been stilled by 
frequent demonstrations that scores on speeded tests correlate to 
about -90 or more with scores on the same tests given without 
time limits, nor by Spearman’s suggestion that different indivi- 
duals have different preferential rates of work which do not affect 
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their actual ability. Cattell (1943) concludes, from his survey of 
the hterature, that inability to do tests quickly is one of the main 
differentia between older and younger adults, and recommends 
that tests for adults of mixed age should usually be unspeeded. 

The problem is a complex one since the influence of speed varies 
not only with the age of the subjects and type of test material, but 
also with the mstructions given or the methods of recording speed 
and power, accuracy or level. As Davidson and Carroll (1945) 
point out, most time-limit tests emphasize both aspects, and 
actually measure a nuxture of the two components in varying 
proportions. 

McFarland (1928) gives a useful critical review of the early 
literature. He points out that many investigators have failed to 
measure speed and power independently, but concludes that they 
are fairly closely correlated in the performance of mental tests. 
Himmelweit (1946) also reviews the many rather contradictory 
researches on speed and accuracy, and shows that they accord 
reasonably well with the generalization that speed and accuracy 
are highly correlated among tests of complex mental functions, 
but may be negatively correlated among motor or manipulative 
tests Intermediate relationships are found for simpler mental 
tests or for mental + manipulative (e.g. mechanical) tests. The 
separation is increased when, as in many motor performances, 
subjects are aware of any errors they make One would suggest in 
addition that correlations rise when skills are highly practised. 
The learner is apt to be either quick or accurate, but the experi- 
enced worker is usually quick and accurate, or slow and inaccurate. 
Thus handwriting speed and quality always correlate positively in 
the primary school, though not highly (cf. Burt, 1917, 1939; Gates, 
1924). 

In Himmelweit's own experiments, common factors of average 
variance 35 per cent, were found among quickness scores, also 
among accuracy scores, on tests of addition, cancellation, under- 
lining words hidden in pied material, and the practical Track 
Tracer test, even when intelligence as measured by Progressive 
Matrices was held constant. In the absence of other reference 
tests It is difficult to specify the nature of these factors The 
investigation was mainly concerned with showing that dysthymic 
mental patients (anxiety cases, etc.) are slower and more accurate 
than hysteric patients. 
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Proofs of the Speed Factor in Mental Tests. Davidson and 
Carroll’s (1945) investigation, at the college student level, gave 
convincing results. The Army Alpha and other verbal and 
numerical tests were scored for the time taken to try all items once, 
and for items correct without time limit, as well as by the ordinary 
time-limit method. Nineteen measures, apart from time-Umit 
ones, were analysed and six factors were claimed. But several of 
these were inter-correlated, and it appears more justifiable 
merely to examine the second and third (bipolar) factors, before 
rotation. These clearly differentiate speed scores from level 
scores, and verbal tests from numerical tests. Their variance is 
small compared with that of the first — general — ^factor, but both 
approximate to 10 per cent. It is unfortunate that no non-verbal 
tests were given under the same conditions, but we have no reason 
to suppose that speed-level scores on these would not have been 
similarly differentiated. 

Similarly Slater (1938) gave untimed CAV (Thorndike) tests 
to 14-year children and arrived at speed scores by timing each 
correct answer and calculating the average rate of work. Some 
non-verbal g tests were also given. By tetrad analysis he demon- 
strated the presence of a ^ group factor in the verbal power scores, 
and a speed group factor in the speed scores. All the other inter- 
correlations could be accounted for by The ^-saturations of the 
speed scores were very low. An ingenious application by Tate 
(1948) of analysis of variance to the (individually-timed) speed of 
answering graded Arithmetic, Number Series, Sentence Com- 
pletion and Spatial Relations problems, showed a highly consistent 
speed factor both m the different materials, and at different 
difficulty levels. The correlations of speed with level scores were 
all around zero. 

Yet another technique of measurement has been developed by 
Furneaux (1948). Cyclic tests are used, consisting of batches of 
items of similar difficulty, each batch being more difficult than the 
preceding one. These are given with time limit, and accuracy 
scores are obtained from the percentage of correct responses in 
the batches attempted, speed scores from total items attempted. 
The correlation between speed and accuracy under these circum- 
stances is about -5. Possibly, however, these scores are more akin 
to Guilford’s measures of ‘carefulness’ than to Slater’s and Tate’s 
power and preferred-rate-of-work scores. 
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Discrepant Investigations. The high correlations between 
tests given with and without time limits do not disprove the 
existence of a speed factor, for the time-limit score is usually very 
largely a power one. Bernstein’s (1924) early study is often cited, 
where numerous tests were done by 11-13 year boys under condi- 
tions of leisure or of haste. No distmctive group factors emerged, 
and both sets correlated equally well with teachers’ assessments 
of intelligence. Moreover, the differences between scores on the 
two sets failed to correlate with (rather unreliable) estimates of the 
boys’ slowness at work. But these negative results are probably 
attributable to the fact that even the leisure’ tests were done with a 
time limit. In other words the conditions of the experiment and 
methods of measurement were inadequate to bring out the differ- 
ence that Davidson and Carroll found. Similarly in Sutherland’s 
(1934) experiments with group verbal, performance, and other tests, 
the conditions were imperfectly controlled. He partialled out (held 
constant) power scores obtained without time limit, and did find 
some residual correlations among time-hmit scores, but was unable 
to prove that these yielded a statistically sigmficant speed factor. He 
did, however, show that such a factor was likely to be more promin- 
ent in simpler cogmtive tests than in tests of higher mental functions. 

Speed in Tests Other than Intelligence Tests. It will be re- 
called that rate has been established as a partially distinct factor in 
reading, at least at the adult level (p. 45). Kelley’s (1928) analyses 
of tests given to 13-year and 9-year groups similarly postulated a 
separate speed factor in reading and arithmetic speed tests, over 
and above their general, verbal and numerical content. Whether 
this is the same as speed at intelligence tests has not been studied. 
As Sutherland and Himmelweit indicate, a distinct speed factor is 
most readily demonstrable in simple cognitive or in motor tests. 
Hargreaves (1927), Holzinger (1934-5) and Woodrow (1938) have 
used such tests as speed of writing figures or words, copying prose, 
coding (substitution), counting clusters of dots, and simple addi- 
tion sums, which show considerable overlap beyond^, Hargreaves 
found that this factor entered largely into so-called fluency tests. 
"While there is msufficient evidence to justify identifying fluency, 
in the sense of wealth of associations, with speed, it seems qmte 
possible that Thurstone’s W might turn out to be mainly the speed 
aspect of F. PresumablyP and^, the perceptual speed factors, are 
also closely related to mental speed. 
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There is no clear distinction between speed factors and the ease 
vs, difficulty factors discussed above. Thus an investigation by 
DuBois (1932) showed much higher correlations between easy 
Arithmetic, Analogies, Directions and Vocabulary tests than 
between these and power tests. This result might be interpreted 
as evidence for an ease rather than a speed group factor. Yet 
another link is suggested by M. D. Eysenck^s (1945) work with 
senile patients, where a group factor was found among tests of oral 
and writing speed and in digit span or rote memory tests. This has 
not been duplicated in more normal groups of subjects. 

One might expect cognitive and manual dexterity tests to yield 
separate factors, apart from the slight dependence of both types of 
test on g. But Holzinger showed that there was a common element 
in his mental speed tests and in tapping, dotting, writing, and 
tracing a simple maze, as well as specialized factors within each 
type. In other words there was a general speed factor which could 
be sub-divided. It is by no means clear where the dividing line 
comes, whether for example speed of copying prose is more ‘men- 
tal’ or ‘motor’. The Division of Occupational Analysis also reports 
a very general speed factor with loadings for almost all the time 
limit tests, which is however most prominent in manual tests that 
do not demand great accuracy. 

Conclusions. These findings raise extremely difficult problems 
for intelligence and other psychological testing although, in view 
of the high correlation of most time-limit with power scores, they 
do not justify the criticisms often made of tests for children. (We 
have admitted that such criticisms are more legitimate in respect 
of adult tests ) Allowing that level and speed can be distinguished 
under certain conditions, should we regard them as additional 
sub-factors, constituting as it were another dimension to our 
Fig. 2? Or should we regard level scores as the most appropriate 
measure of any general or group factor, speed being a distinctive 
factor? If so, is speed general to a great variety of tests or largely 
specific, or can it be reduced to certain other better established 
factors, W, P, M, or to ease vs, difficulty? 

In the writer’s view, the solution should depend more on 
practical considerations than on factor analysis alone, although 
further intensive investigation of practice, difficulty, carefulness, 
speed and other factors is urgently needed. For the time being we 
shall obviously continue chiefly to use time-hmit tests, for the 
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sake of administrative convenience. Provided that the limits are 
generous these should yield reasonable approximations to power 
scores for the main factors — g, v, n, k, etc. But the important thing 
is to follow up power, speed and mixed (time-limit) scores, and 
find which yield the best validities. Thus in 11+ selection, the 
grammar schools presumably require a modicum of speed-at-work 
in addition to power We ought therefore to find out the optimum 
weighting, and it might then well turn out that our present tests. 
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Fig 4 Diagram of Intellectual and Practical Factors in 
Psychological Tests 


with shghtly increased limits, would reproduce this weighting as 
accurately as a cumbersome system of separate speed and power 
measures. Similarly in the vocational field, where the speed and 
accuracy of simple cognitive, or of manipulative, operations are 
more readily measurable, follow-up will show the appropriate 
combination for any job. And when a fairly wide experience has 
been gained in this way, it will be easier to decide the best means 
of picturing speed, accuracy, and other components of human 
abilities. 
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Conclusions Regarding the Structure of the Main Factors 
in Psychological Tests. It would be more confusing than helpful 
to try to portray all the factors mentioned in this book in a single 
diagram. Hence educational and occupational abilities are shown 
separately in Figs. 3 and 7. The sensory, perceptual, aesthetic and 
other factors described in Chapter VIII are also isolated, in Fig. 5. 
Here however in Fig. 4 we can bring together the findings of this 
and the two preceding chapters, and forestall those of Chapters IX 
and X. Personality and physical factors are omitted, though their 
influence (e.g. on fluency, or on athletic abilities) should not be 
forgotten. It is not possible to show factors attributable to the 
form of the test, wrong answer factors, or alterations brought 
about by practice. But the speed factor is indicated by an enclosed 
area on the left of the diagram. Some of the links that seem to 
occur only in selected groups such as high-grade or low-grade 
subjects, are shown by dotted lines. 
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SENSATION, PERCEPTION, IMAGERY, AND 
AESTHETIC ABILITIES 

Abstract, There is little or no empirical evidence for the 
numerous types of perception, attention, imagery, reaction time, 
etc., which are so popular in German psychology. A small sensory 
discrimination factor may be recognized, whose visual branch 
includes distinctive colour vision sensitivities. The auditory 
branch (which enters only to a small extent into musical and speech 
abilities) may also be sub-divided. Numerous visual perception 
factors have been distinguished by Thurstone and others, though 
their significance in educational, vocational, or abnormal psycho- 
logy is not known. Imagery types (visual, auditory, motor) can be 
distinguished by appropriate techniques. Musical discrimination 
is well substantiated, and it may be Imked with aesthetic factors in 
the visual arts and in literature. 

Types of Perception, Reaction, etc. When the early experi- 
mental psychologists, particularly those in Germany, observed 
individual differences in people’s responses to sensory and other 
stimuli, they were apt to classify them into ‘types’. Thus muscular 
and sensory types were distinguished in simple reaction time, 
synthetic and analytic types of perception of tachistoscopic 
material, broad but shallow vs, narrow and concentrated types of 
attention, and so forth. Little or no attempt was made to deter- 
mme their consistency, e.g. whether people who were synthetic 
with one set of material were synthetic with other sets. As early 
as 1904, Spearman questioned this point. Reaction time types 
soon multiplied and were seen to represent little more than quick- 
ness vs, slowness. McQueen (1917) found that people who 
were able to ‘distribute their attention’ between one pair of tests 
were not superior in distributing it between other pairs. But 
German psychologists ignored such results, together with the 
more objective approach offered by Spearman’s techniques. Even 
in the 1920s and ’30s their work was characterized by a plethora 
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of unsubstantiated and unco-ordinated typologies (cf. Vernon’s 
summaries, 1933ab). Very few distinctive factors have emerged 
from British and American investigations m this field, though 
admittedly such investigations have been few in number, and 
much remains to be discovered. 

Sensory Factors- In his early study of forty-three schoolboys, 
Burt (1909) noted higher correlations between four tests of sensory 
discrimination (two-point threshold, lifted weights, pitch, length 
of lines) than could be attributed to g. He suggested that such 
discrimination constitutes a wide, though shallow, factor, since 
each measure of acuity has a large specific component; also that 
visual and auditory perception might yield distinct sub-factors. 
Little further work has been done, possibly because most of the 
tests have to be applied individually and are too time-consuming 
with large numbers. However, Carey (1915-16) confirmed the 
existence of an auditory group factor, but found no visual, tactile, 
or general sensory factor beyond g, Burt (1927) points out that his 
factor would not provide any justification for the popular notion 
of perceptive vs reflective types. Such types, if they exist at all, 
are likely to be more a matter of temperament and interest than of 
abilities. 

Differentiation between the senses has also been attempted by 
means of memory tests, based on the recognition of previously 
presented visual, auditory or other stimuli. Carey found a rote 
memory factor, beyond^, in visual and auditory material, but only 
very slight grouping according to sense modality. Walters (1935) 
found distinctive abilities for visual + auditory recognition, and for 
gustatory + olfactory. Tactile recognition partook of both factors. 

Factor analysis has been applied in the sensory field to colour 
sensitivities by Burt (1946), Pickford (1949) and Jones (1948); and 
to smells by M D. Eysenck (1944) and Hsu (1946). Though it 
does differentiate types of colour vision, it seems unable to decide 
between theories of the Helmholtz and Hering varieties. Other 
difficulties are discussed by Cohen (1949). Recently fourteen tests 
of visual acuity were analysed among 792 U.S. Army recruits 
(Personnel Research Section, 1948). The main factors appeared 
to be: retinal resolution, brightness discrimination, perception of 
letters, and perception of shapes. 

Imagery Factors. Carey also studied types of imagery, but 
found the current objective tests to be quite useless. He was able 
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to derive fairly reliable measures of visual and auditory imagery 
from mtrospections, even among schoolchildren, but was unable 
to prove any differentiation between these modahties, or any over- 
lap either with visual and auditory discrimination or memory 
(apart from g) A more promising method was developed by Burt 
in 1912. He got several persons to assess the vividness of their own 
images of a hundred different experiences. There was fair agree- 
ment between all the ratings, showing that certain experiences can 
be imaged more readily than others by everybody. But some sub- 
jects tended to give higher ratings to their own visual images and 
low to their auditory and motor images, whereas others assessed 
their auditory images or their motor images relatively highly. By 
inter-correlating the ratings of twelve subjects, Burt (1938) was 
thus able to show that the group factors corresponding to these 
imagery types earned 18*2 per cent, of variance, as compared with 
46 7 per cent, of vanance attributable to the general order of 
vividness. Burt (1940a) has shown also that this technique of 'cor- 
relation between persons’ reveals the same group factors as the 
more usual correlations between tests Further investigation along 
these lines would be profitable. It is possible that visual and verbal 
types might be found more fundamental, and that important links 
might be established with aesthetic, practical and intellectual 
abilities Clearly visual imagery has something to do with A. 
Another research suggesting an imagery factor is that of Ormiston 
(1939), but her tests are not fully described and no correlations are 
quoted 

Perceptual Factors. By far the most extensive investigation is 
that of Thurstone (1944), where forty-three sensory and perceptual 
tests (almost all visual) were given to 170 students, and ten factors 
identified. Chief of these was Facility and Firmness in Perceptual 
Closure, or the ability to see a relatively unorganized stimulus as a 
good configuration. Several k tests showed high loadings on this 
factor. Flexibility in Manipulating Conflicting Configurations was 
another factor present in two-hand co-ordination tests and m cer- 
tain problem-solving and reasoning tests. It sounds as though 
this was largely g Susceptibility to Optical Illusions was distmc- 
tive, also Reaction Time to light or sound, and an Oscillation factor 
in rate of fluctuation of reversible perspective figures. Two differ- 
ent Speed factors appeared, but they do not seem to be related to 
the primary ability, P. Form and colour dominance tests failed to 
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yield a consistent factor, and measures derived from the Rorschach 
Inkblot test gave a factor among themselves (perhaps in the nature 
of richness of association), but showed little overlapping with any 
other perceptual factors. This is in marked contrast to the claims 
of Oeser (1932) and of numerous German psychologists that pre- 
ference for form or for colour constitutes an important type which 
connects up with Rorschach types, with Jaensch’s eidetic types, 
and with physique and temperament (cf. Vernon 1933b). Thur- 
stone suggested that administrators, leaders, good and poor readers 
and other groups obtain distinctive profiles or patterns of scores 
on his perceptual factors. No further confirmation of this has been 
published, and in view of the negative results of much other re- 
search, we are entitled to doubt whether they have any sigmficant 
bearing on educational or vocational abilities, or on personality. 
They are more likely, perhaps, to be differentially affected in 
different types of mental illness. In a later article, Thurstone 
(1948) reports a closure factor in auditory tests, and proposes to 
investigate whether this is the same as his first visual factor. 

M. D. Vernon (1947) studied the perception of a variety of 
materials presented tachistoscopically, and showed by their inter- 
correlations that they fell into two main types. The first involved 
rapid discrimination of fine details of shapes, while the second 
depended on assimilation of shapes and comprehension of their 
meaning. She suggests a correspondence between these and 
Thurstone’s Closure and Flexibility factors respectively. The first 
also sounds very similar to non-verbal P factor. Actually however 
the second, not the first, gave moderate correlations with a spatial 
test and with AH4 Pt. 2 — a test of k and (probably) P. She 
points out that there is great specificity in perception, since the 
performance of testees depends so largely on the particular condi- 
tions of the experiment, and on the sets or ‘schemata’ by means of 
which they interpret what they see. 

One large-scale study was made during the war of a battery of 
perceptual and motor tests designed for selecting A.T.S. recruits 
for anti-aircraft work. This included visual acuity, aircraft spot- 
ting, two tests of perceptual acuity, pursuit meters, dotting, etc. 
In a group of about 400 women the overlapping was so small and 
irregular that a first factorization justifiably attributed it all to g. 
Re-analysis did however show additional factors with variance 
11*9 per cent., the ^-variance being 4 • 0 per cent. These appeared 
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to be a general sensory-motor factor (perhaps the same as Guil- 
ford’s Psychomotor Co-ordmation), and sub-factors for the 
sensory-perceptual and the co-ordination tests. The statistical 
significance of these factors was doubtful, and none of them except 
g showed any relationship to proficiency. In the U,S.A.A.F., 
Guilford and Lacey found a distinctive factor in tests involving 
estimations of lengths and distances. The Thurstones (1941) also 
note a narrow factor at 14 years in several tests involving counting 
the numbers of dots in patterns. 

Auditory Factors. Rather more work has been done m the 
auditory and musical fields. The Seashore and other tests such as 
Drake’s and the Kwalwasser-Dykema senes always gives a group 
factor beyond g, but tend to be so poor in reliability that no con- 
sistent sub-grouping is found. For example, in Drake’s (1939) 
analysis of four Seashore and four other tests among 163 boys aged 
around 13, there was a general factor with over 30 per cent, 
variance, and strong residual overlap between Pitch and Intensity, 
Pitch and Tonal Movement (Kwalwasser), Tonal Movement and 
Tonal Memory (Seashore). Manzer and Marowitz’s (1935) 
correlations between the ten Kwalwasser-Dykema tests among 
452 students suggest a musical training factor (in Pitch and 
Rhythm Imagery, Tonal Memory and Tonal Movement), and a 
sensory factor (m Time, Quality, Rhythm, Pitch and Intensity 
Discrimination, and in Tonal Memory). Several other researches 
indicate that elementary auditory capacities such as those measured 
by the Seashore tests have very little relation either to musical 
ability or to perception of speech (cf. Howells and Schoolland, 
1934). Thus among seventeen tests given by the writer to some 
seventy students, a Musical Knowledge test and total score on 
the Oregon Music tests had general musical factor loadings of 
• 84, whereas the Seashore Pitch and Rhythm tests had loadings of 
*28 and *35. The Seashore Tonal Memory test, however, had a 
general factor loading of *65, and the three Seashore teats had a 
considerable group factor of their own. 

In an extensive research, Karim (1942) gave thirty-two tests, 
mostly auditory, to 200 high school pupils, and identified eight 
factors after rotation. Though he claims that auditory abilities are 
complex and yield no general factor, in fact almost all his corre- 
lations were positive The first (unrotated) factor carried some 
15 per cent., and the combined bipolars 26 per cent, of variance. 
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We would agree that group factors are more prominent than 
general auditory ability m his very miscellaneous battery. The 
mam factors appeared to be. 

Pitch discrimination for complex and pure tones, and quality 
discrimination. 

Loudness discrimination. 

Time discrimination (overlapping with loudness, and other 
tests) 

Perception of masked and distorted speech. 

The remainder, involving auditory span, various memory tests, 
etc., were less clear-cut. 

Musical Ability. Wing (1941) has investigated a large number 
of music tests, and quotes a typical set of correlations between the 
seven best ones, which were selected to form a battery for measur- 
ing ‘Musical Age’. A general factor (40 • 8 per cent, variance) was 
strongest in the most complex test of judging goodness of phrasing 
and rather weaker in simpler judgments of good rhythm or good 
intensity. This (or rather the summed score on the whole battery) 
correlates only to about 3 with intelligence tests, and is, according 
to the author, but little affected by musical training. But m two 
experiments where reliable assessments of musical ability were 
obtainable, its validity averaged *80. Bipolar factors, with 
variances 13*4 and 3 1 per cent., indicating the presence of group 
factors, separate the tests of perception of alterations in melodies 
and chords from tests of judgment of suitable harmony, rhythm, 
etc. A much less marked distinction is indicated between ability 
to judge or perceive harmony, and ability with melody or rhythm. 

Vidor’s (1931) research into musicality is suggestive, though 
carried out on only thirty-five children and without resort to 
factorization. She claims the highest correlations (i.e. general 
musical factor saturations) for creative tests such as completing 
melodies, or making up melodies to fit rhythms. She finds no 
evidence of distinctive creative and receptive (perceptual) abilities, 
but thinks that melodic and rhythmic abilities may differentiate. 
There is no evidence for the latter in Wing’s results, and the 
present writer obtained almost identical correlations with other 
tests for the Oregon Rhythm and Melody scores. On the other 
hand the Harmony score correlations often differed, thus con- 
firming Wing’s third factor. 
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Like Wing, Vidor considers musicality to be independent of 
training, at least in children up to about 15. A closer integration 
seems likely to develop later, for in the writer’s research, measures 
of training and musical knowledge correlate very highly with 
Oregon musical judgment scores, and with any other tests based 
on musical (as distinct from auditory) material. The writer also 
has some evidence to support the popular view of a connection 
between mathematical and musical abilities, though he knows of 
no published proof. 

Aesthetic Discrimination. The correlation-between-persons 
technique has proved useful m the visual and literary arts. Burt 
(1933) collected fifty reproductions of miscellaneous paintings, and 
on gettmg these arranged m order of appreciation by artists and 
artistically naive adults and children, found a strong tendency to 
uniformity. Hence a person’s approximation to the standard order 
can be used as a measure of his artistic discrimination. Eysenck 
(1940) showed that the same factor extends from paintmgs of 
landscapes, portraits, etc., to other visual material such as pictures 
of clocks, embroidery and vases, and abstract curves and polygons, 
and even to odours. As with imagery, subsidiary type factors can 
be established, some individuals for example tending to rate for- 
mal, classical art more highly than colourful, impressionistic or 
representational art, some the reverse. These, however, might be 
regarded as attitudes or tastes, which fall outside the scope of this 
book, whereas the general factor is more akin to an aesthetic 
ability. Guilford and Holley (1949) suggest that an individual’s 
communality may be regarded as measunng his ‘objectivity’ of 
judgment, whereas his specificity shows the subjective element in 
his tastes. 

Dewar (1938) found positive though low correlations among 
children between a modification of Burt’s test, and other art 
judgment tests such as McAdory’s, Meier-Seashore’s and Bulley’s, 
and teachers’ judgments of the children’s artistic ability. This 
confirms the existence of a general factor, but suggests that in 
children discrimination is rather unreliable, being much affected 
by specific factors derived from the method of testing, the material 
used (e.g. paintings, furniture,, etc.), and types of taste. Like 
musical ability, it correlates with intelligence, but not highly. 

Similar tests consisting of literary passages were developed by 
Williams, Winter and Woods (1938), and applied to groups of 
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girls aged 11-17, totalling 256. Here a general literary discrimina- 
tion factor showed a rather high correlation with a verbal in- 
telligence test, i e. with g + In fact its variance was reduced 
from some 53 per cent, to 16 per cent, by holding intelligence 
constant. The literary factor also showed some overlap with tests 
of artistic and musical discrimination, but it is not indicated 
whether this might be accounted for by g, or whether there is a 
general aesthetic capacity for all types of art, plus more speciahzed 
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Fig. 5. Diagram of Sensory, Perceptual, Imagery and Aesthetic 
Discrimination Factors 


factors in the different arts. The former might well be true of 
children, though Eysenck’s results suggest the latter among 
relatively sophisticated adults. 

Conclusions. It is particularly difficult to express the findings 
of the present chapter diagrammatically, because of the lack of any 
comprehensive investigations which might show the relations 
between sensory, perceptual and aesthetic, and well-established 
factors. Moreover, exploration of this field has been extremely 
scrappy, and an indefinite number of other such factors will 
probably be revealed by further investigation. There is little or no 
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channelling of perceptual-imagmal abilities into a few major 
branches by cultural influences as there is in the intellectual field. 
No attempt has been made, therefore, in Fig. 5 to indicate the size 
of ^-loadings by distances from the centre, nor the amount of over- 
lapping by contiguity. Nevertheless, the diagram serves to put 
almost all the factors mentioned in this chapter into some sort of 
order. Note that no general perceptual or imagery ability (akin to 
general educational or practical ability) is claimed; also that, al- 
though all auditory and musical functions are probably linked, 
there is no proven connection between visuo-perceptual, visual 
'imagery, and visual art factors. 



CHAPTER IX 


PSYCHOMOTOR AND PHYSICAL ABILITIES 

Abstract. There is sufHcient overlapping among manual and 
sensory-motor tests, particularly in unselected groups, to justify 
the conception of a psychomotor factor over and above g and k:m, 
together with group factors for special types of performance. But 
their variance is so small that their limits and essential content 
are not known, and they may be much affected by age, sex, prac- 
tice, etc. Overlapping is greater among the more complex tests, 
also in low-grade subjects, but this may be due to their greater de- 
pendence on mental factors. 

The existence of a general physical-athletic factor is well sub- 
stantiated, together with clear-cut group factors in different types of 
athletic performance. G and k.m enter to a small extent, and there 
seems to be a linkage between gross and fine muscle co-ordinations. 

Specificity of Psychomotor Abilities. Early work on sensory- 
motor and manual tests (usefully summarized by Weiss Long and 
Pear, 1932) revealed very low correlations among such tests, and 
between them and intelligence or vocational proficiency. Since 
they usually have to be given individually, it has seldom been 
possible to test large groups, and all correlations tend to be un- 
reliable and lacking in statistical significance. At the same time 
there is not complete specificity; most of the correlations are posi- 
tive, though some may be negligible and even negative, especially 
in high-grade or restricted groups. 

In general the overlapping is greater among more complex tests 
For example, Farmer^s (1927, 1929, 1936) battery of ‘aestheto- 
kinetic’ tests, namely, Choice Reaction, Dotting Machine and 
Pursuit Meter, has consistently yielded a group factor, beyond g, 
the average inter-correlation being about *25. This battery gives 
small correlations with several trade skills, and with freedom from 
accidents. Farmer’s work suggests too that accident proneness is a 
fairly broad factor. For example, those who undergo many 
accidents m their jobs tend to do the same at home. 
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R. H. Seashore (1930, 1940) has developed and standardized a 
reliable battery of six tests, known as the Stanford Motor Skills 
Unit, namely: 

Spool Packing, speed of bimanual co-ordination. 

Koerth Pursuit Rotor, accuracy in following a movmg target at 
high speed. 

Motor Rhythm, precision in repeatmg an auditory rhythm on a 
tappmg key. 

Serial Discrimination, quickness m making discriminatory finger 
responses to number signals. 

Tapping speed, on a telegraph key. 

Speed Rotor, speed of rotatory arm wrist and finger movement 
in turmng a hand dnll. 

Though he denies the existence of a general motor ability, Sea- 
shore’s own figures and those of Walker and Adams (1934) yield a 
mean correlation of over *3, even among students. He states that 
the degree of specificity is as great among children down to six 
years as among adults. McNemar (1936) gave five tests (two from 
Seashore’s battery) to 182 jumor high school boys, and found a 
stronger general factor when some of them had been practised. 
The mean correlations were -266 before and -392 after practice. 
In low-grade groups such as the defective boys tested by Atten- 
borough and Farber (1934) and the senile adults tested by M. D. 
Eysenck (1945), the mean correlation between quite simple tests 
(pegboard, tapping, nuts and bolts, etc., among the former, and 
ergograph, steadiness and aiming among the latter) average close 
to *4; that is, they show some 40 per cent, of common variance. 

Group Factors. Tests of closely similar functions tend to yield 
higher inter-correlations, thus suggestmg that it would be more 
legitimate to talk of dexterities rather than of dexterity or of a 
general psychomotor ability. However, there is little agreement 
so far as to which group factors are the most consistent and 
distinctive. 

Earle and Gaw (1930) compared tests depending mainly on 
speed and others depending more on precision of finger, hand and 
arm movements, and found an average inter-correlation of *36 
among the former, *29 among the latter, but only * 13 between the 
two types of test. They confirm Burt and Moore’s (1912) finding 
that boys tend to be superior on straightforward speed and 
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strength tests, girls on tests that appear to involve ‘neat-fingered- 
ness’. Thus this grouping may be related to sex differences. 

Buxton (1938) gave nine steadiness, tapping and packing tests 
to seventy-six boys and obtained very low average inter-corre- 
lations. But factorial analysis suggested the presence of narrow 
group factors in closely related skills. More extensive was Sea- 
shore, Buxton and McCollom’s (1940) investigation of mneteen 
tests among fifty students, which yielded the following groups: 

Simple reaction times. 

Tapping, oscillatory movements in one plane. 

Stylus tapping, between two or more plates. 

Postural sway. 

Serial discriminatory reactions. 

Pursuit co-ordination. 

Motor rhythm. 

The grouping appeared to be functional rather than anatomical, 
for alterations in the particular muscles or senses employed reduced 
the correlations less than did changes in the patterns of behaviour, 
Dudek and Seashore (1948) quote similar results. The investiga- 
tion by Allport and Vernon (1933) of expressive movements like- 
wise indicated the existence of fairly general patterns of action in 
numerous simple tasks (drawing, walking, etc.). For example, 
there was an areal or expansive tendency, a contrast between 
centripetal and centrifugal movements, and a factor of strength 
or emphasis. These types of movement appeared to be related to 
underlying personality traits. In addition the natural or normal 
speed adopted in forty-five varied tasks was recorded, and though 
there was a slight tendency to positive inter-correlation throughout 
there were more marked group factors of verbal speed (reading, 
counting, also handwriting), drawing speed (on paper, on black- 
board, with foot), and rhythmic or motor speed (tapping or con- 
traction of various muscles). The Division of Occupational 
Analysis finds two factors among apparatus tests which are identi- 
fied as F (finger dexterity) and M (manual dexterity). The Min- 
nesota Placing and Turning tests and a new pegboard test are good 
measures of M, while F emerges in fine assembly work. In addition 
there is a hand-eye co-ordination or aiming factor in drawing and 
writing tests (cf. p. 134), and a speed factor in manual and other 
tests (cf. p. 84). The correlations on which these factors are based 
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have not been published, hence the variances cannot be assessed. 
Guilford and Lacey (1947) and Melton (1947) find a broad 
Psychomotor Co-ordination factor in a complex serial reaction 
time test, pursuit, aiming, and finger dexterity tests, and various 
smaller factors among pairs of tests. Other factors described by 
these authors (^1, 53, cf. p. 75) combined abilities at certain 
apparatus tests and at paper-and-pencil tests. 

Dependence of Psychomotor Abilities on Higher Factors. 
According to the hierarchical viewpoint, psychomotor factors 
branch off from the major k:m or practical group factor, and 
several researches do show loadings of sensory-motor tests with 
higher factors, at least in heterogeneous groups. 

Van der Lugt (1948) makes the interesting suggestion that the 
correlations of manual tests with intelligence are non-linear, being 
moderately high among very dull children, and zero or even 
negative among very bright ones. This is borne out by Atten- 
borough and Farber (1934), who obtained coefiicients averaging 
•52 between dexterity and tapping tests and Stanford-Binet and 
Otis Primary tests among eighty boys whose I.Q.s ranged from 
45 to 105 (median 70). Again M. D. Eysenck (1945) found a 
mean correlation of *26 between the Matrix test and Steadiness, 
Tapping, Dynamometer and Ergograph tests among seventy-five 
senile patients. 

Investigations in the Services. Few psychomotor tests were 
used on a large scale in the British Services. However, Table IX 
gives a group-factor analysis of thirteen tests applied to 500 naval 
recruits — a moderately, but not highly, selected group. Test 102 
involves judging the tension of stretched strings. Test 104 con- 
sists in picking up ball-bearings with tweezers, spoon and fingers. 
The others have been described elsewhere (Vernon, 1947b). The 
Table shows the two major factors and their sub-divisions into 
verbal numerical, informational, spatial, and manual dexterity 
minor factors.^ Here the g and k:m variance of the two psycho- 
motor tests amounts only to 3-1 per cent., their minor factor 
content to 7*3 per cent. 

Three apparatus tests were regularly applied m the R.A.F. — 
SMA3, a hand-eye co-ordination test, and a finger dexterity test 

^ It should be pointed out that the extraction of as many as eight factors from 
500 cases is open to criticism, and that minor factor loadings denved from only 
two tests each are mdetermmate. 
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(cf. Vernon and Parry, 1949). An analysis of these and nineteen 
other tests among 785 rather highly selected aircrew candidates 
gave an average ^-variance of 4*7 per cent., k:m or perceptual 
factor variance of 6*0 per cent., and an additional psychomotor 
factor of 16*2 per cent. 

Other Studies. Teegarden (1942) quotes correlations for large 
and heterogeneous groups of young adults between the Kent- 
Shakow Formboard (simple speed problems and complex prob- 
lems scored separately), Minnesota Spatial Relations Formboard, 


TABLE IX GROUP-FACTOR ANALYSIS OF MECHANICAL AND 
OTHER TESTS AMONG 500 ORDINARY SEAMEN 



Minnesota tests of placing blocks in holes and turnmg them over, 
and Cincinnati Pliers Dexterity (a modification of Johnson’s 
Tweezer and Pinboard test). The tests are too few to analyse, but 
they suggest a small g and a larger k -m saturation throughout, with 
an additional manual group factor in all but the complex Kent- 
Shakow, which is most prominent in the Phers test. 

Earle and Macrae (1929) included nine dexterity and speed tests 
in a research with sixty-six boys and found a mean correlation of 
only *16 with their adaptation of the Stenquist Mechanical 
Assembly test. But in another group of 125, three dexterity and 
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speed tests correlated *22 with Stenquist and gave similar 
coefficients, ranging up to -36, with the (paper-and-pencil) Form 
Relations test and Group Test Thirty-four, and with several 
standard performance tests. A somewhat discrepant result was 
obtained by Shuttleworth (1942) among 109 13 -year technical 
school boys, who took sixteen tests used in the Birmingham voca- 
tional experiments. The Ball-lifting test (SP Test 104) gave a 
correlation of -24 with his general factor but on the bipolar factor 
separating k.m from v:ed tests, neither this dexterity test nor 
Memory for Designs and Squares spatial tests, showed any 
mechanical loadmg. The selectivity of the group may perhaps be 
responsible. 

Cox’s Investigations. Perhaps the most lUummating work in 
this field is that of Cox (1928, 1934), though based chiefly on small 
groups of schoolboys and on the inconvenient (yet highly delicate) 
tetrad difference technique of factorization. In his earher experi- 
ments he intentionally elimmated any element of manual dexterity 
by using paper-and-pencil tests with pictures of mechanical 
models. He was able to establish overlapping in such tests, beyond 
g, which he attributed to m factor, that is the capacity for com- 
prehending and employing mechamcal relationships and prm- 
ciples. Later he studied tests of assembling and stripping, and 
showed the same factor to be present in these, particularly in the 
less routine assembly processes. When routme assembly and 
stripping and other dexterity tests such as pmboard and eyeboard 
were compared, there was consistent evidence of small residual 
overlappmg, with g and m held constant, which he attributed to a 
"routine manual factor’. Additional mmor group factors were 
indicated among sets of closely similarly dexterity tests. Each test 
of course, also involved a specific factor, and this specific variance 
was much greater in the dexterity tests than was their m, manual 
or other factor content. 

Minnesota Investigation. Some of the inter-correlations ob- 
tamed in the Miimesota study of mechanical ability are shown in 
Table X. Though they were not subjected to group factor or 
multiple factor analysis by Paterson and Elhot, they indicate the 
presence of a prominent general factor, presumably a mixture of 
g and k:m. Clearly there is also 2 iv:ed group factor in the Academic 
Grades and Otis I.Q., possibly entering into the Paper Formboard 
and Information tests. Packing Blocks and Card Sorting show a 

H 
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dexterity factor which overlaps into the first five mechanical and 
spatial tests, though not into the informational and interests tests. 
Wittenborn (1945) analysed these and other Minnesota figures by 
the centroid method, and his results tend to confirm these sug- 
gestions. He claims in addition a Maturational factor m dyna- 
mometer tests, height and weight; a Strength factor, mainly in 
dynamometer tests, and possibly a Steadiness factor, and Per- 
ceptual Speed. 

TABLE X. SELECTED CORRELATIONS FROM THE MINNESOTA 
INVESTIGATION OF MECHANICAL ABILITY 



1 2 

3 4 5 

6 7 8 

9 10 

11 

1. Quality of Shop 
Work 

2 Minnesota As- 
sembly Test 

•55 

•55 





3. Minnesota Spat- 

ial Relations 
Formboard 

4. Paper Formboard 

5. Stenquist Picture 

Test 

•S3 -56 
•52 -49 

•31 -40 

•63 

•39 -30 




6. Mechanical In- 

formation 

7. Mechanical In- 

terests 

8. Home Mechani- 

cal Operations 

•42 35 ' 

•64 -42 

•30 -40 

•40 -57 -34 

•46 -39 -28 

*22 -24 -19 

•47 

•35 -30 



9. Otis I Q. 

10. Academic Grades 

•21 -06 
•42 -13 

•18 -53 -18 
•26 -40 -28 

•67 *23 *10 

•54 -25 -12 

•57 


1 1 . Packing Blocks 

12. Card Sorting 

•26 -30 
•19 -13 

•34 -14 -21 
•23 -14 -24 

— 04 -12 -00 

•00 -09—12 

— 06 -12 
•05 *09 

*0 


Harrell’s Investigation. Centroid analyses with rotation of 
axes naturally obscure any g or k:m content of manual tests. Thus 
Harrell (1940) obtained the following five independent factors 
from his application of thirty-two tests (fifteen of them manual) to 
ninety-one cotton-mill operatives: 

(1) A verbal factor in Completion, Opposites and Analogies 

tests and schooling. Doubtless this is largely^. 

(2) A spatial factor in paper-and-pencil tests, including Cube 

Counting and Minnesota Formboard. 
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(3) An age and experience factor entering into some of the 

mechanical tests, but correlating with poor performance 
on some of the manual speed tests. 

(4) A manual dexterity factor especially prominent in Pinboard, 

Dotting, Assembling and Stripping Nuts and Bolts, and 
Packmg Blocks. Half of the dexterity tests gave apprecia- 
ble loadings, and few showed significant loadmgs with 
the other four factors. 

(5) A factor which Harrell suggests is perceptual. But as it has 

loadings on Routine Assembling and Stripping, Mech- 
anical Assembly and Stenquist Picture tests, one would 
have thought that it approximates more to a mechanical 
factor than to non-verbal P. 

Studies of the MacQuarrie Test. A number of investigations 
which throw some light on manual and higher general or group 
factors have been made with the MacQuarrie battery of paper-and- 
pencil tests of so-called mechanical aptitude, Bingham (1937) 
quotes correlations for employees of the Scovill Manufacturing 
Company between the battery as a whole and verbal and per- 
formance tests, which suggest, according to a group-factor analysis 
by the writer, that it has a ^-loading of at least * 6 and a smaller 
loading Jorgensen (1934) has also found it to have high ^-content. 
Harrell (1940) included it in the research just mentioned. The 
Copying, Location, Block-Counting and Pursuit tests were 
chiefly covered by his first two factors {g + k), while the Tracing, 
Tapping and Dotting tests depended on the first and fourth 
{g + manual). Goodman (1947) and Chapman (1948) factorized 
the inter-correlations among 329 radio assembly workers, and 
likewise established a spatial (or^ + k) factor, and a manual factor 
in the three latter tests. There were indications of a perceptual or 
visual inspection group factor in some of the tests. This is borne 
out by Murphy’s (1936) analysis of eighteen tests among 143 
14-year boys, which included Copying, Tracing and Tappingfrom 
MacQuarrie. Copymg fell in a cluster with his spatial tests, but 
the other two formed a distinct group factor with Substitution and 
Checking tests from Army Beta, that is with tests akin to P. 

Conclusion. Although psychomotor abilities do not seem to be 
so devoid of structure as early experiments suggested, it is never- 
theless true that they are predominantly specific. Thus the notion 
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of dexterity or handiness as a general factor in manual occupa- 
tions, which can be measured by one or two pegboard, nut and bolt 
or similar tests, should be discouraged. The diagram m Fig. 4 
attempts to portray the main results reviewed above. 

Physical Abilities. Jones and Seashore (1944) point out that 
greater generality is usually found among gross than among fine 
muscular capacities, and that there is much more justification for a 
general athletic than for a general motor factor. Several analyses 
of physical measures and athletic tests have been published in the 
Research Quarterly of the American Association for Health and 
Physical Education (Wendler, 1938; Hall and Wittenborn, 1942; 
Brace, 1946, etc.).^ These are said to reveal factors of Strength, 
Speed, Agility, etc. Such trait-names, however, are apt to mislead 
(cf. Appendix, p. 134), and it would be better to describe the 
factors in terms of the performances they cover. McCloy (1940) 
claims that strength, speed of movement and dead weight factors 
consistently emerge in such studies. He quotes correlations be- 
tween six athletic measures, four dynamometer tests, and weight, 
among 163 junior high school girls All correlations are positive 
and (except those for weight) fairly large, hence a group-factor 
analysis appears more appropriate than his rotated centroid 
analysis. Such an analysis by the writer indicates. 

(1) A general physical factor in all measures except weight, with 

some 42 per cent, variance. 

(2) A ‘strength’ factor in dynamometer tests, weight, and shot- 

putting and ball-throwing. 

(3) A group factor in two jumping events and one running, 

weight being negatively loaded. The variance of Nos. 2 

and 3 amounts to some 26 per cent. 

A similar research by Highmore (1949) carried out with male 
physical training students gave a general factor with 32*0 per cent, 
variance, and group factors with 11-7 per cent, for: 

(1) Running events. 

(2) Putting, throwing and kicking. 

(3) Standing and running, broad and high jumps. 

A study of nine athletic tests applied to 450 Army recruits was 
made during the war. When g, age, height and weight were held 

^ Other references arc listed, and reviewed, by Highmore (1949). 
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constant, a general physical factor accounted for 27.4 per cent, of 
variance, and another 16 *4 per cent was covered by group factors 
which include: 

(1) Running 100 yards, running 1 mile, walking 5 miles, Army 

Agility test, 

(2) Long jump, high jump, obstacle race. 

(3) Two types of chinning or pull-up tests. 

As m the case of psychomotor abilities, physical ones show some 
impregnation with^ and k:m, though their group-factor content is 
much larger. The mean ^-loading of the above nine measures was 
only *11 (variance 1.2 per cent.). However, in another investiga- 
tion of 578 Army recruits, thirteen tests or measures were analysed. 
After the extraction of a large g and a v:ed factor, there were small 
residual correlations, shown in Table XI. These clearly mdicate 


TABLE XI. CORRELATIONS BETWEEN MECHANICAL AND 
PHYSICAL TESTS AFTER EXTRACTION OF G AND V:ED 


Test 

g 

loading 

Squ. Mech. Ass 

Med. 

Cat Youth Agihty 

4 Squares Spatial 

•67 



2 Bennett Mechanical 

73 

•12 


8 Mechanical Assembly 

36 

25 *24 


Medical Category 

•34 

06 *03 -16 ' 


Youth (age reversed) 

•24 

•03 *01 *07 

•20 

16 Agihty 

•31 

—01 *04 *02 

•20 *14 

10 Morse Aptitude 

•58 

— 02 -00 —04 

•03 -16 -06 


mechanical and physical group factors The latter extends into the 
Morse Aptitude test and suggests a link between physical and 
auditory abilities In addition most of the physical measures corre- 
late shghtly with the spatial-mechanical, particularly with the 
manipulative assembly test. The same figures have been analysed 
by Banks (1949), with similar results. Overlapping between 
physical and mechanical abilities is shown also by the discovery 
(cf. Vernon and Parry, 1949) that the Army Assembly test is usually 
more valid in predicting proficiency at jobs requiring physical 
effort, in Infantry, R.A.C. and R.E., than in predicting mechanical 
skill and trainability. 

We would expect to find a closer association between all these 
factors among low-grade adults or children than among high-grade. 
No investigation of physical tests along with manual, mechamcal 
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and intelligence tests appears to have been carried out with feeble- 
minded subjects, but a study of African recruits is relevant. 
Thirteen tests given to between 308 and 631 men are listed in 
Table XIL Most of these are simplified adaptations of British 
ones; the Fourth Corner is a performance test along the lines of 
individual Matrices. Three factors were extracted, but the third 
failed to yield any logical grouping, probably because of the varying 
numbers of cases, and the two listed in the Table gave a good fit. 
The first, general, factor hardly represents g, in the sense of 
educing relations, since it gives such large loadings to the simple 
formboards and dexterity tests. Probably it corresponds to a 
general adaptability to the unfamiliar testing situation. Mechanical 
Comprehension gets the poorest loading, perhaps because it is the 

TABLE XII. FACTORIZATION OF TESTS GIVEN TO AFRICAN 

RECRUITS 


Test 

Unrotated 

I 

Factors 

II 

Arithmetic 

•57 

•34 

Progressive Matrices (revised) 

*66 

•29 

Fourth Comer Test 

•66 

•23 

Block Design 

•68 

•18 

Cube Construction 

•68 

•08 

Mechanical Comprehension 

•27 

•07 

Formboard, circular insets 

•69 

•02 

Formboard, square insets 

•56 

—•03 

Mechanical assembly 

•77 

— 07 

Screwboard Dextenty 

•64 

— 20 

Reversible Blocks Dexterity 

•56 

— 21 

Pegboard Dexterity 

*41 

— 30 

Agility 

•40 

— 42 


most un-African and the least reliable test. The second factor 
clearly divides the tests into the primarily cognitive, including 
verbal, pictorial and performance, and the primarily manipulative 
and physical. The Formboards partake of both group factors and 
so show the lowest loadings. The cognitive factor is quite close to 
British g, while the opposite pole is a mixture of our manual 
dexterity and physical factors. This study raises several interesting 
speculations. Should we regard k:m, not as a ‘descendant’ of g, 
but as arising in the first place from man’s physical abilities which 
later link up with his mental abilities? Or is there an important 
racial, or cultural, difference? Further comparative investigations 
would be very fruitful. 





CHAPTER X 


PERFORMANCE TESTS AND MECHANICAL 
ABILITIES 

Abstract. Investigations of performance and mechanical tests 
tend to yield very diverse results owing to the effects of back- 
ground, age, training, etc Much work has been done in the 
Services, as well as among children. This shows that g usually 
plays a large part, except in the more unreliable performance tests 
and m mechanical assembly tests. In general, performance tests 
measure the same ^ ^ factors as paper-and-pencil spatial tests, 

while mechanical tests measure these factors and a mechanical 
information or experience group factor. Thus, apart from their 
greater attractiveness, practical or manipulative tests show little 
or no advantage over paper-and-pencil tests. Additional minor 
group factors are indicated among special types of performance 
tests, and within the field of mechanical information 

What Do Performance Tests Measure? Tests such as Form- 
boards, Picture Completion, Porteus Mazes, Kohs Blocks, and 
Cube Construction have been widely used for some thirty years in 
clinics and in vocational guidance (cf. Gaw, 1925), to measure a 
more ‘practical’ type of intelligence than the mainly verbal Binet 
or group tests. It has often been assumed that they give some 
indication of aptitude for mechanical or manual jobs, or for tech- 
nical as contrasted with academic education. McFarlane (1925) 
found some overlapping, beyond that due to g, among boys but 
not girls in a number of practical tests including Cube Construc- 
tion, the assembly of a wheelbarrow out of wooden pieces, and 
Healy’s puzzle box where the testee has to puzzle out an elaborate 
system of latches and fastenings to open the box. Spearman, how- 
ever, considered that performance tests are merely rather unreli- 
able g tests, and Cattell (1936) reiterated this view. Kohs (1923) 
put forward his Block Design test purely as a measure of general 
intelhgence. El Koussy’s (1935) survey of the literature (up to but 
not including Alexander’s work) also showed that the weight of 
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evidence was against any single or clear-cut practical factor in all 
performance tests, at least among children. Neither Pintner- 
Paterson’s, Gawd's, Drever-Collms’s, nor Arthur’s standard batter- 
ies of tests appear to have been analysed thoroughly, presumably 
because of the time taken to test sufficient numbers. Kelley (1928) 
quotes correlations for small groups of 13 -year boys and girls 
between eight of Gaw’s tests, but as he chose the ones showing 
most independence it is not surprising that they failed to yield any 
logical pattern. Schiller (1934) included three of the poorer Pint- 
ner-Paterson tests, also Drawing a Man and the non-verbal Army 
Beta and Otis Primary group tests in an investigation of twelve 
tests among 395 nine-year boys and girls. A group-factor analysis 
by the writer indicates that the Pintner-Paterson total scores and 
Drawing a Man have ^-variance of about 20 per cent, and k- 
variance of 10 per cent, or less, i.e. very high specificity, the same 
factors being present in the non-verbal group tests. 

Factorial Studies of Performance Tests. Morris (1939) gave 
the Pinter-Paterson tests and a series of manual tests to a some- 
what selected group of fifty-five boys aged 9|-. He points out 
that many of the performance test inter-correlations are so low, 
sometimes even negative, that they cannot all be measuring the 
same ability, and claims that the three rotated factors which he 
extracted correspond to Thurstone’s Space, Induction and Per- 
ceptual Speed. But his results, which are very unreliable with 
small numbers, hardly appear to support this identification. In- 
stead they show a general factor with loadings of ■ 5 to • 6 for most 
of the better performance tests and around * 3 for the manual tests, 
and a bipolar factor contrasting most of the tests in these two 
groups. Teegarden’s figures, mentioned above, also show con- 
siderable positive correlations between two formboards (the Kent- 
Shakow Industrial and Minnesota Spatial Relations) and manual 
dexterity tests in adults. Similarly among Attenborough and 
Father’s mentally defective boys, there was overlapping of Healy 
II, Knox Cube, Porteus Mazes, Cube Construction and Dearborn 
Formboard with Binet and Otis on the one hand, and with 
Stenquist Assembly and various dexterity tests on the other hand. 
Vernon’s (1937) study of the Moorrees Formboard indicates that 
its six difficult insets provide a moderate test of^, while its six easy 
insets measure a rather different ability — probably a kind of man- 
ual dexterity. 
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Thomson (1940) analysed the performance tests used in the 
Scottish individual test survey of eleven-year-olds and obtained a 
general factor, present also in Stanford-Bmet, together with some 
indication of a separate group factor in the speeded tests. But as 
the battery only contained one g ox g v test, a distinct perform- 
ance factor could not be expected. Burt (1940b), however, showed 
that additional mmor factors were present in the performance 
tests, one for speeded tests, one for pictorial and a third Imguistic’ 
factor in tests with elaborate verbal instructions such as Knox 
Cube, Kohs, Cube Construction and Binet. 

Earle and Milner (1929) applied numerous tests to over 300 
13-14-year children, and found that they could be classified into 
three main groups: 

(1) Stanford-Bmet and Group Test 34 — g + v tests with high 

loadings. 

(2) Cube Construction, Dearborn Formboard, Form Relations 

and Memory for Designs — ^performance and paper- 
and-pencil tests — also Stenquist Assembly. These were 
partly dependent on g, but showed additional overlap 
suggesting a spatial-practical-mechamcal group factor, 

(3) Other performance tests mcluding Cube Imitation, Substi- 

tution, Picture Completion II and Porteus Mazes— ^ 
tests with low saturations and no other common factors. 
The mean correlations within and between these groups 
are shown in Table XIIL 

TABLE XIII. EARLE AND MILNER’S CORRELATIONS BETWEEN 
DIFFERENT TYPES OF TESTS 



1 

2 

3 

1 

(• 80 ) 

•38 

•38 

2 

•38 

(• 46 ) 

•28 

3 

•38 

•28 

(• 33 ) 


That the ^-loading of performance tests changes with age is 
suggested by Arthur’s (1930) correlations of *81 at six years and 
• 26 at 14 years between her battery and Stanford-Binet. We would 
suggest however, that this is merely due to most of her tests 
approaching their ceiling among older children, and to the increas- 
ing verbahty of the Stanford-Binet. More appropriate performance 
tests do continue to provide a moderate measure of intelligence^in 
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adolescents and adults. Even in an extremely high-grade group of 
255 Civil Service candidates, the writer found a ^-saturation of 
about ‘60 for the Kohs-Misselbrook Block test. Balinsky’s (1941) 
results indicating considerable general factor saturations for the 
Wechsler-Bellevue performance tests from 9 to 60 years, and 
doubtful alterations with age, have already been mentioned (p. 29). 

Alexander’s F and the ii-Factor, Alexander’s investigation 
(p. 18) proved that some performance tests do measure a factor 
beyond g. In his adult group, Picture Completion and Formboard 
tests actually obtained larger loadings on his F (practical) factor 
than did the three tests chosen for his performance scale, and 
Porteus Mazes much the same loadings. The three younger 
groups took three of the pictorial tests which Cox devised for 
measuring m (p. 101). In the youngest groups they appeared to 
act mainly as g tests, but among the older youths they approxi- 
mated quite closely to the F tests. Possibly because of their com- 
plicated instructions they also showed small verbal loadings. These 
results, and others quoted below, suggest that F is identical with 
the spatial factor k, and that Cox’s m is largely composed of k also. 

The ^-content of performance tests such as Alexander’s is a 
matter of considerable importance in attempting to differentiate 
‘academic’ and ‘practical’ types of children at 11+ , Burt (1947) 
suggests that the correlation between reliable intelligence and 
performance tests is about 6, making differentiation very difficult. 
Alexander (1947) claims that it is nearer *3, though a table 
published in the Instruction Book of his battery, listing verbal and 
performance test LQ.s, yields a correlation of *54, as Dempster 
(1948) points out. Alexander considers too that jF is quite distinct 
from the k factor in paper-and-pencil spatial tests. He relies 
mainly on the evidence of Drew’s (1947) factorial investigations, 
whose accuracy has been called into question by Slater (1947). To 
the writer, the main fallacy in Drew’s work is his identification of 
F with Passalong scores. This is the least reliable test in Alexander’s 
battery, and when Kohs Blocks and Cube Construction are also con- 
sidered, the identity ofF with A is obvious. Thus in Drew’s 11-12-|- 
groups of 353 boys, the correlations appear to be accounted for by; 

(1) I'- 

(2) a verbal group factor in the Moray House test, Spearman’s 

V test and teachers’ verbal ability assessments, 
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(3) a spatial factor in the Alexander tests and Spearman's k test, 
possibly also in a non-verbal g test and teachers’ assess- 
ments of practical ability. 

Drew’s results for eighty-eight techmcal school boys (16+) 
are more complex because the boys were clearly selected for g and 
v,ed, and because of the unreliability of the coefficients. But the 
writer’s re-analysis yields: 

(1) a k factor in the Alexander tests, a formboard, a spatial test 

and a non-verbal g test. This enters too into Technical 
Drawing and Shopwork marks. 

(2) a verbal factor in Simplex and the v tests, and in marks for 

Enghsh and Science. 

(3) A separate scholastic or X factor in all four sets of marks. 

Emmett (1949) also re-analysed some of Drew’s figures, and 
found a common factor in Alexander’s battery, a k test and a non- 
verbal test. Wilhams (1948) included the Alexander scale in an 
analysis of verbal, mechanical, spatial and non-verbal^ tests among 
250 twelve-year boys. The v, m and k tests gave distinctive group 
factors, and the Alexander tests fell in the same cluster as the k 
ones. There was no trace of a performance factor distinct from 
paper-and-pencil abilities. Dempster (1948) reports similar 
results with nmety-one 11 -year boys, though he quotes no 
figures. Price (1940) worked with only eighty-five University 
students, but his results are similar. He gave three verbal and non- 
verbal g tests, three spatial tests, Kohs Blocks, Passalong, Dear- 
born Formboard, Cylinder Construction and Woolley’s (Blind- 
fold) Formboard. After extracting a general factor, a bipolar 
factor was found which separated the g tests, Passalong and 
Woolley from the spatial and the other performance tests. Pre- 
sumably then Passalong and Woolley are the poorest measures of 
the spatial factor in high-grade adults. The residuals, which are 
statistically insignificant, do provide a very famt trace of a dis- 
tinction between paper-and-pencil and manipulative tests. 

The only relevant evidence collected in the Services was in a 
group of 500 air mechamcs, who took the Trist-Misselbrook 
revision of Kohs Blocks. This obtained almost identical loadings 
on three unrotated factors to those of the Squares spatial group test. 

Conclusion. Before proceeding to mechanical tests it would be 
well to summarize the position. Many of the conomonly used per- 
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formance tests are unreliable, or embody large specific factors, 
hence it is unsafe to use any battery or scale as a measure of g. The 
correlation of such a battery with Bmet or group verbal tests is 
further reduced by the considerable ^-element in such tests. 
Nevertheless, it is likely that g is the major common factor in all 
except the simplest manipulative performance tests, hence it is 
impossible to differentiate effectively any large proportion of 
children, at 11+, into academic and practical types. Most of 
the more thorough performance tests, Kohs Blocks and Cube 
Construction in particular, bring in a fair amount of the k factor 
which is measured by spatial group tests, and which is closely 
linked with the m or mechanical factor. Though there is little 
direct evidence, it is probable that some of the simpler ones depend 
too on a manual dexterity factor or factors. If large enough num- 
bers of subjects are tested with a varied battery, minor group 
factors begin to emerge among different types of test — ^pictorial, 
speeded, etc. But there are no grounds for assuming a broad per- 
formance or practical factor distinct from g, k, m and dexterity. 
Hence, apart from the dexterity element and the greater attractive- 
ness of performance tests to testees, there is no reason why psycho- 
logists should not substitute the more reliable and convenient 
paper-and-pencil mechanical and spatial tests for performance tests. 

Mechanical Tests* Murphy (1936) suggested that tests claim- 
ing to measure mechanical ability are so variegated that the exist- 
ence of a distinctive common factor is doubtful. On analysing a 
battery of eighteen tests (cf. p. 103), she found that a spatial rela- 
tions fector and a hand-eye co-ordination factor would account for 
all their inter-correlations. While this may be true of the Mac- 
Quarrie tests, it is unfair to tests that do involve comprehension 
or assembling of actual mechanisms, none of which was included 
by Murphy. We would agree that a large portion of what is com- 
monly called mechanical ability actually consists of k\ this may be 
deduced from the Minnesota study and Alexander’s, Earle’s, 
Milner’s and Macrae’s work. Slater (1940) too analysed the Cox 
and Vincent models tests in his study of spatial tests among 
apprentices, and found no separate m component. Again it is 
interesting that the sex difference in k holds also for mechanical 
tests. Indeed Williams’s research, which included Cox’s Designs, 
revised Bennett Comprehension and Vincent Models, is the only 
one to show any clear differentiation. 
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Mechanical Information Constitutes a Distinctive Factor. 

In several investigations in the Services, Test 2 Mechanical 
Comprehension and Test 4 Squares Spatial obtained much the 
same factorial composition (cf. Table But when additional 
spatial and mechanical comprehension or information tests were 
analysed, as in Table IX, a separation emerged between these two 
types. Table VII similarly indicates partially distinct k and 
mechanical information group factors. The different factor load- 
ings of Mec-B, a picture test based on comprehension of the work- 
ings of machines and Mec-C, a straightforward information test, 

TABLE XIV ALTERNATIVE GROUPINGS OF THIRTEEN 
TESTS ANALYSED AMONG 283 R.A.F FITTERS 

I'Gen-A Verbal 
J Arithmetic 

(g> 5RAF Matrices 

( Group Test 80 Spatial 
K-6 Spatial 

Vmcent Models 
Fmger Dextenty 
Mec-A Bennett Mechamcal 

/SP 103 Wirebendmg 

J Mec-B Mechanical Diagrams 
‘ Mec-C Mechamcal Information 
117M Mechanical Information 
m 17E Electrical Information 

are noticeable. Guilford (1948ab) claims indeed that the only 
distinctive element in any mechanical tests, apart from spatial, 
dexterity and other factors, is one of information. The difference 
between k and information is further underlmed by the rather 
disappointing results obtained with k tests in selecting mechamcs 
in the British Services, and the comparatively good results ob- 
tained with information tests (Vernon, 1947b). This may of course 
be due partly to the inverse effects of age on the two types of test 
(cf. p. 33). 

Analyses of Mechanical Tests. The complexity of over- 
lapping between g, k, m, and dexterity, and the extent to which the 
predilection of the factorist may affect the grouping, is neatly 
illustrated by a research m the R.A.F. by Wheeler (1948). The 
tests listed in Table XIV were given to 283 airframe fitters, that is 
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a group with some previous mechanical traimng. Several factorial 
techniques were applied and the tests fell equally well into the 
four groups or the three groups shown here. In the four-group 
solution, G-5 Matrices goes with the k tests, information tests are 
distinct, and Bennett Mechanical Comprehension + two mani- 
pulative tests yield a separate mechanical factor. But the alter- 
native analysis groups the k tests (excluding G-5) with mechanical 
comprehension, and contrasts them with information + Wire- 
bending; while G-5 goes with the g -\-v tests. In this study the 
general factor (a mixture of g and k,m) covered about 28 per cent., 
and the combined group factors 20 to 22 per cent, of variance. 

The American naval classification battery of twelve tests was 
factorized by Peterson (1943) to yield three factors called Verbal, 
Mechanical-Spatial and Quantitative Reasoning. This solution 
was unconvincing, and a re-analysis by the writer gave a more 
logical picture which is entirely congruent with British results. In 
addition to a prominent g, a verbal group factor appears in Reading 
Comprehension, Opposites, Analogies, Completion and Arith- 
metic, and a k:m factor in the remaining seven tests. Two minor 
group factors are needed, one in the four mechanical and electrical 
information measures, and one in Block Counting, Mechanical 
Comprehension and Surface Development (k) tests. 

Mechanical Information Group Factors. A sub-grouping 
in the information field was suggested by the writer’s analysis of 
six tests among 136 naval radio and electrical mechanics. A 
prominent general factor of electrical-mechanical knowledge 
accounted for 43^ per cent, of variance in Bennett’s Mechanical 
Comprehension test, Tests 117 E and M of everyday electrical and 
mechanical information, an oral test of trade electrical knowledge, 
a test of knowledge of radio symbols, and Test 3b Mathematics. 
Though there was no differentiation between electrics and 
mechanics, a bipolar factor with 5 per cent, variance contrasted 
the mathematically more advanced Test 3b and Radio Symbols 
with the more practical or applied information in the oral trade 
test, 117M and Bennett. A similar study of eight comprehension 
tests was carried out in the U.S.A.A.F. by Davis (1947). His main 
bipolar factors contrasted the more verbal tests (Technical Voca- 
bulary, Electricity and Knowledge of Tools) with a test of Prac- 
tical Experience in Applying Mechanical Devices, and with tests 
in Principles of Mechanics and Mechanical Movements. If wo 
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were to measure more specialized knowledge of different branches 
of engineering, we should doubtless find further group factors 
separating, in the same manner as the secondary school or univer- 
sity subjects mentioned in Chapter IV. 

Manipulative Tests. Assembly or other manipulative me- 
chanical tests tend to give very high correlations with the com- 
plex. In five analyses which included Bennett and Squares paper- 
and-pencil tests, and Army Assembly, Meccano Assembly or 
Wirebending, the mean loadings on of the former were *35 
and of the latter, practical, tests -76. Their respective mean g- 
loadings were -59 and -25. It seems likely therefore that, just as 
the F of performance tests is covered hyk, so assembly tests involve 
no factor which is not measurable by paper-and-pencil tests of k 
and information. We would allow that assembly tests, like some 
performance tests, may bring in a small dexterity component. 
The Minnesota figures (Table X) suggest that if there is any addi- 
tional mechanical manipulative factor among boys, it is quite 
small. Thurstone (1948) is engaged on an extensive research into 
the structure of mechanical ability, and his findings on this point 
will be awaited with interest. 

One relevant analysis was made among 130 Army recruits under 
training as driver mechanics. Table XV shows the unrotated 


TABLE XV. CENTROID AND GROUP FACTORS AMONG 
MECHANICAL TESTS APPLIED TO ARMY DRIVER MECHANICS 


Test 

Centroid Factors 
(unrotated) 

I II III 

General 

g+km 

Group Factors 

2 Bennett Mechanical 

•654 -154 -318 

'662 

‘339 

4 Squares Spatial 

•539 -258 ^089 

'521 

•339 

Meccano Assembly 

•876 -141 — 266 

•924 

•333 

Wiring Dextenty 

•551 -291 — 155 

•531 

•333 

Mechamcal Information 

•724 — 280 -116 

•631 

'450 

Mechamcal Interests 

•232 — 298 — 053 

•136 

'351 

Course Marks 

•795 — 256 —-085 

•707 

'471 

Variance per cent. 

mem 

39'5 



Thurstone factors, and the group factors extracted by Burt’s 
technique. The Wiring test is a complex manual test of joining 
wires to terminals Mechanical interests were measured by a test 
of the Strong type. The general factor is, of course, a mixture of g 
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and k:m, but there appear to be minor group factors, of doubtful 
significance, differentiating paper»and“pencil tests, manipulative 
tests, and measures of interest and information (the latter including 
the assessments of proficiency at the end of the course). 

The finding that paper-and-pencil tests will (provided allow- 
ance is made for their higher ^-loading) dover almost the same 
ground as practical performance or mechanical tests is of great 
importance to vocational psychologists. Practical tests naturally 
possess greater ‘face’ validity and may be justifiable on the grounds 
of their attractiveness to candidates and acceptability to employers. 
Yet it is a waste of time and money to use them if their statistical 
validity is not superior. In the investigation just described, the two 
manipulative tests combined correlate *674 with the driver 
mechanics’ course results, and the four pencil-and-paper tests 
correlate *693. 

Many psychologists, including the writer, like performance, 
assembly, or other practical tests not so much because of the pre- 
dictive value of their scores, as because the testee’s method of 
tackling the problems, his interest and concentration, etc., appear 
to give valuable qualitative clues (cf. e.g. Oakley and Macrae, 
1937 ). Here we have the same situation as with the qualitative 
tests that clinical psychologists apply to neuropsychiatric patients 
(cf. p. 57 ) There may well be ‘something in it’, but if so, the 
qualities indicated by the tests, over and above the actual test 
score, should be shown to constitute reliable and consistent group 
factors. And so far no one has done this. 

Effects of Training. It must be recognized that the patterning 
of k:m abilities depends to a considerable extent on the experience 
and training of the testees ^ An analysis of post-war regular entry 
naval air mechanics, all of whom had some trade experience, in- 
cluded the standard naval Tests 1-4 and 74 , four Recruiting 
Centre tests, and an earlier Recruiting Centre battery of intellig- 
ence, arithmetic and dictation tests. Table XVI shows clear 
mathematical and dictation group factors, but there is no true k:m 
factor at all. Instead a technical education factor combines mathe- 
matics and the comprehension + spatial tests. 

In another experiment parallel analyses were made of nine tests 
given to groups of 240 15-year and 18-year naval artificers. The 
younger group was just starting, the older group had nearly com- 
pleted, its engineering training. In Fig. 6 the first bipolar factor 
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TABLE XVI. ROTATED CENTROID FACTORS AMONG TESTS 
APPLIED TO 312 NAVAL AIR MECHANICS. (Loadings less than -10 

omitted). 


Test 

Tech. 

g Educ- Anth. Diet. 

h^ 

SP Test 1, Abstraction 

•76 

•58 

RD Test, Abstraction 

•78 

•61 

Recnntmg Centre Verbal Intelligence 

•43 

•19 

SP Test 4, Squares Spatial 

42 *11 

•20 

SP2, Bennett Mechamcal Comprehension 

•47 *41 — 11 

•42 

RA Test, Mechanical Comprehension and 



Information 

•45 -40 

•37 

SP3a, Anthmetic j 

•29 — 22 -59 -15 

•50 

SP3b, Mathematics 

•59 -37 *53 

•77 

Recrmtmg Centre Arithmetic 

•56 -24 -56 

•70 

RB Test, Mathematics 

•63 -14 -59 

•76 

SP Test 74, Dictation 

•42 -57 

•50 

Recrmtmg Centre Dictation 

•24 -60 

*42 

RC Test, Spelhng 

•50 -68 

*73 

Variance per cent. 

27-8 4*7 10*1 9-3 

51-9 


H 


•97 

•4 

•3b 

•WB 

.2 


•Cojr 

.M ^ 


•E 



15 year group 


n 


•4 

•97 

•/ 

•3b 

•Cox 

•WB 

•E 

‘2 -M 


18 uear 


year group 


Key to the Tests 

1 — ^Abstraction Cox — Cox Mechanical Models, A — Squares Spatial 

2 — ^Bennett Mechanical M — ^Mechanical Information 97 — ^Memory for Designs 

36 — ^Mathematics E — Electncal Information. WB — ^Wirebendmg. 

Fig, 6. Graphs of Factor Loadings in 15-year and 18-year 
Artificer Apprentices 
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for the younger group chiefly contrasts educational tests like 3b 
with technical information tests M and E. In the older group this 
factor has almost vanished and the main distinction, running from 
R to L instead of from top to bottom, is between technical educa- 
tion including mathematics and information on the one hand, and 
practical ability, chiefly represented by Wirebending, on the other. 
The technical education factor is presumably the same as that of 
Table XVL The spatial tests (4 and 97), Abstraction Test 1 and 
Mechanical Comprehension Test 2 stay in much the same posi- 
tions in the two graphs; but 3b, M and E move towards one 
another, and Wirebendmg moves outwards — m other words 
manipulative skill becomes more distinctive. Cox Models is a 
comprehension test at the younger age, but apparently becomes a 
more practical one later. Fig. 6 omits the first factor loadings of 
the tests, since they were closely similar in the two age groups. 
There was no evidence of greater differentiation of abilities, or 
lowered influence of ^ with age (cf. p. 30). 

Another instance of alterations in structure with experience 
and/or with level of ability is offered by the analysis of tests given 
to African recruits (p. 106). The results were congruent with those 
obtained in this country, but showed closer integration of me- 
chanical, manual and physical abilities on the one hand, and of 
intellectual {g) and educational abilities on the other hand. Fur- 
ther differences have been found in this country between men 
and women. 

Analyses Among Women. Six main analyses were carried out 
on A.T.S. tests, three with representative populations of women 
recruits, one with a lower grade group of cooks, and two with High- 
grade groups of motor mechanics and special operators. In the 
two latter the results were quite similar to those of men, except for 
the restriction in variance of g, due to high selectivity Table XVII 
shows the group-factor analysis loadings for special operators. 
Note that Spelling (SP 14) and Educational Standard have 
especially low ^-saturations, and that Tests 2 and 4, but not 
Matrices, measure zk»m factor. Among the motor mechanics the 
most interesting finding was the poor mechanical loading of a 
Models test. The A.wz-saturations were: Matrices * 12, Bennett 
•44, Squares *47, Assembly 75, Meccano Assembly *64, Vincent 
Models T7. 

Much greater difficulty was experienced in arriving at acceptable 
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factorial solutions among normal women (cf. Table XVII) and 
cooks. Not only did the Clerical and Spelling tests appear to 
obtain unduly high ^-loadings, but Bennett Test 2 had extremely 
low g and k:m\ presumably it measures a very largely specific 
factor in women. Further the Matnces test could not be differ- 
entiated from the k:m tests. Note the similarity of this result with 
a non-verbal g test to the results obtamed by Slater among 
11-13 year children, and to those of Mellone and McFarlane 
(pp. 68, 107), Either we must admit then that tests such as 
Matrices do depend largely on k:m in women — and this is the 
solution adopted in Table XVII — or that the k:m factor has hardly 


TABLE XYIL GROUP-FACTOR ANALYSES AMONG 200 REPRE- 
SENTATIVE A.T.S, RECRUITS AND 200 SPECIAL OPERATORS 


Test 


Normal Group 

Special Operators I 


g 

V ed 

k m 

h^ 

g 

V ed km 


Progressive Matnces 

•75 


•32 

67 

•68 

•00 

•46 

2 Bennett Mechanical 

•41 


•29 

■26 

•43 

•31 

•28 

4 Squares Spatial 

•53 


•51 

•55 

•38 

•50 

•39 

24 Meccano Assembly 

•43 


55 

•49 

— 

— 

— 

14 Spelling 

•64 

51 


67 

•16 

•74 

•58 

23 Arithmetic 

•68 

•34 


•58 

•49 

*51 

•50 

12 Clencai 

•81 

•26 

1 

•72 i 

•38 

•52 

•41 

17, 25 Verbal 

•66 

53 


•71 

•52 

•42 

•44 

Educational Standard 

•57 

•45 


•53 

•23 

•51 

•32 

Variance Per cent. 

38-8 10*2 

8-4 : 

57-4 

18-9 18*9 4-3 

42-2 


differentiated in Test 2 and only partially in Test 4 Squares and 
24 Assembly. The second view would not only mean that spatial- 
mechanical tests measure hardly anything but g, but also that the 
v:ed tests would obtain smaller ^-loadings than in men and larger 
v:ed loadmgs. A factorial investigation by Banks (1949) of large 
groups of Army and A.T.S. recruits confirms the first solution. 
Probably the main reason for the difference between men and 
average women is that the SP non-verbal tests give an even less 
adequate sampling of women’s practical abilities than they do of 
men’s. Had there been tests of, say, domestic and social abilities 
to constitute a contrasting pole with the v -ed tests, a more accept- 
able patterning of abilities might have been found. 

Conclusions. The evidence of this and the precedmg chapter 
indicates that we can hardly expect to discover a structure within 
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the k:m complex as definite as that existing in the verbal-numerical 
field, since so much depends on the experience, training and back- 
ground of the testees, and on the extent to which they are selected. 
Psychologists recognize nowadays that muscular development in 
the infant is neither differentiation of specific skills from general- 
ized mass movements, nor the integration of elementary reflexes 
into complex habits, but a combination of both. Similarly we may 
expect group factors in mechanical ability to differentiate when 
specialized functions are practised, and to reintegrate into fresh 
patterns. Since considerable growth of k:m occurs in the average 
male adolescent (cf. Vernon and Parry, 1949), it is probably 
accompanied by alterations of structure, depending on the skills 
trained in his employment or hobbies. And as any group of sub- 
jects large enough to yield significant factors is likely to be ex- 
tremely heterogeneous in experience and training, such factors 
cannot be very clear-cut and cannot reveal much information 
about the abilities of any particular individual. Vernon and Parry 
(1949) point out also that k and m tests are considerably more 
useful in predicting trainability for mechanical jobs at 13-15 years, 
and among women, than among male adults. This too would be 
expected if k:m tends to differentiate and to become more com- 
plex in structure in men after IS. 

We should not expect investigations of the kind summarized 
above to throw much light on the problem of whether or not 
mechanical aptitude is innate. It would be foolish to deny the 
possibility when other aptitudes such as musical clearly possess an 
innate component. But we should recognize that most of what the 
layman implies by mechanical aptitude is attributable to g, to 
training and experience, and to temperamental and interest factors. 
So far there is no evidence that any person is untrainable in any 
practical job through lack of aptitude, provided he has sufficient g, 
persistence and interest, and is not suffering from nervous or 
muscular disease. 



CHAPTER XI 


OCCUPATIONAL ABILITIES 

Abstract, A prominent general factor of aptitude for all jobs 
appears to be compounded of gy v:edy and a drive component 
closely related to X, Beyond this, job abilities group into the more 
bookish and the more practical, and possibly other major types. 
But the scope of these group factors is probably small compared 
with that of minor group or specific factors. Hence transfer of 
ability from one specialized type of work to another is limited. 
Also the overlapping between factors established among tests and 
job factors is disappomting, so that vocational guidance (as dis- 
tinct from selection) cannot hope to advance very far merely by 
trying to predict aptitude for general types of work from scores 
on a battery of tests. 

Trainability — General or Specific. It is much more difficult 
to map out the main factors in vocational than in educational 
abilities, for the obvious reason that people do not work at several 
jobs simultaneously as they do at several subjects of study. More- 
over, the persons in any single job tend to be more selected, either 
for intelligence or trade experience, than school pupils. However, 
several lines of investigation have been partially explored. 

During the war it was possible to observe the trainability of men 
and women from different civilian occupations in different Service 
jobs. The importance of previous direct experience, i.e. of small 
group or specific job factors, was evident among drivers, tele- 
graphists, clerks, and many types of tradesmen (cf. Vernon and 
Parry, 1949) For example, the correlation between previous 
driving experience and success at Army driving was higher than 
that of any battery of psychological tests. It is fair to state also that 
transfer effects were very linuted, i.e. that experience in other 
superficially similar jobs was often of little or no value, though 
urdfortunately no really reliable quantitative data are available. 
Thus machinists and fitters' and turners' mates did not do any 
better as naval ordnance, or engine room mechanics than did 
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men from non-engmeering occupations such as retail tradesmen, 
and no type of mechanical background appeared to assist radio 
or electrical mechanics. Much more influential — though this is 
difficult to prove — ^was a kind of general factor comprising in- 
telligence and educational level and personality traits such as 
keenness, drive, seriousness of purpose. 'For example, clerks and 
policemen who tended to be high in such a factor did well at almost 
any job to which they were allocated, and usually surpassed men 
with a more practical background. Again it often happened that 
recruits who failed in one Service job had to be reallocated, and it 
was usually found necessary to move them to a job requiring lower 
general ability and application. If they were transferred to 
another job at the same level, in which they claimed some interest 
or experience, only too often they failed again. The layman’s 
notion that there exists a niche or special type of work ideally 
suited to the specialized aptitudes of each individual appeared to 
be much less true than the view that all types of work and all 
employees fall along a single high-grade to low-grade continuum. 
The success of women workers at skilled engineering jobs during 
the war further supports this view, which has also been advocated 
recently by Philpott (1947). Admittedly the criterion of success 
was in most instances one of rapid trainability, and different re- 
sults might have followed had ultimate proficiency been measur- 
able, But so far as our rather patchy and impressionistic evidence 
goes, the situation seems to be very similar to that in education, 
namely that there is a strong general ability factor compounded 
partly of ^ (or ^ ^ partly of a temperamental or interest 

factor akin to AT, and that beyond this group factors or general 
types of vocational abilities are disappointingly small and minor 
group or specific factors relatively prominent. Though the 
industriousness or purposiveness factor is obviously complex 
and is, for example, affected by the recruit’s morale or attitude to 
Service life, yet it is certainly closely related to our X factor. Per- 
sonnel Selection Officers Wnd that school and work record 
provided a most useful indicator, and in at least two experiments 
(Lummis, 1946; Vernon and Parry, 1949, p. 141) direct corre- 
lations were obtained between measures of educational ‘drive’ and 
proficiency in the Services. 

Common Factors in Job Analyses. A second approach is 
that of Coombs and Satter (1949), who had fifty-four jobs in a 
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large paper mill assessed by job analysis experts for the skills, 
knowledge, etc , required. The twenty jobs were then taken which 
showed fewest common elements, namely thirty-five to seventy- 
four out of a possible 104 elements, and correlations were denved 
from the numbers of elements and factorized After rotation a 
clear-cut general + group factor pattern was obtained. The 
general factor, covering 43-4 per cent, of variance, represented 
common features or requirements in all types of job. Four 
‘families ’ of jobs, with combined variance 17*0 per cent., included: 

A. Self-responsible jobs, e.g. employment interviewer, librarian 

supervisor. 

B. Routine entry occupations, e g. messenger, receptionist, 

nurse, shop clerk. 

C. Skilled manual, e g. multigraph, multilith and key punch 

operators. 

D. Clerical. 

The method might fruitfully be extended to a wider range of 
jobs, and would no doubt yield additional types. The authors 
point out that the size of the general factor largely depends on the 
skill and objectivity with which the assessments are made But as 
especial care was taken in this study to avoid subjectivity, and as 
only the most independent jobs were factorized, its large variance 
(relative to the group factors) confirms the conclusions of the 
preceding paragraph. 

Analyses of Training Marks. Obviously it would be better to 
analyse the actual skills of groups of workers, and useful evidence 
was collected m the Services when trainees were marked or rated 
for several different sections or aspects of an occupation. The 
major differentiation between v:ed and k.m types of work recurred 
as constantly as it does in the fields of education and of psycho- 
logical tests. Thus among naval, Army and A.T.S. signallers and 
telegraphists, ability at theory or bookwork is partially distinct 
from ability at Morse. Similarly, informational attainments among 
clerks contrast with typewriting and stenography marks, technical 
acquirements among R.N.V.R officer cadets with personality 
quahties, and so on. Table XVIII shows three (unrotated) factors 
extracted from eight sets of marks awarded to 250 naval engine 
room mechanics. A fairly prominent general factor runs through 
all the marks, but the bipolars indicate two group factors: 
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(1) Written general paper and electricity — predominantly theor- 

retical subjects. 

(2) Sheet metal work and fitting. 

The remaining four subjects all give different factor patterns. 

TABLE XVIII. CENTROID FACTORS IN THE COURSE MARKS 
OF ENGINE ROOM MECHANIC TRAINEES 


Subject 

I 

11 

III 

h' 

Written theory paper 

•32 

— 31 

— 40 

•36 

Electncal Equipment and Wiring 

*56 

— 56 

— 07 

•63 

Sheet Metal Work 

•63 

•34 

•08 

•53 

Precision fittmg 

•66 

•39 

— 05 

•59 

Garage 

•77 

— 09 

— 01 

•62 

Engines 

•74 

— 10 

47 

•77 

Oxy-acetylene welding 

•49 

•08 

•27 

•32 

Centre Lathe 

•40 

•25 

— 31 

•32 

Variance Per cent. 

34-9 

9-7 

7-1 

51-7 


Sets of marks awarded to trainees in seven other branches such 
as signallers, telegraphists, R.N.V.R. officer cadets, and instructors 
showed much the same degree of overlapping, i.e. the same factor 
variances. Naval visual signalmen, for example, yielded distinctive 
group factors for theory or bookwork, for Morse receiving and 
transmitting, and for visual signalling. With leading torpedo- 
men and electrical mechanics there was somewhat greater 
specificity, only 38 per cent, of variance being covered, probably 
because the duties in which they are examined are unusually 
diversified. Among electrical mechanics there were small group 
factors representing: 

(1) Mming and Workshop. 

(2) Gyro Compass and Whitehead (Torpedo). 

(3) Schoolwork, Preliminary Electricity and Low Power 

Electrical Equipment. 

Analysis of Objective Measures of Workshop Ability. Now 
all training marks, even when awarded by independent examiners, 
are liable to be influenced by the examiners’ opinions of the 
trainees as individuals, their industriousness, alertness, etc. 
When objectively marked capacities are compared the generality 
is considerably reduced. A detailed study was made of twenty- 
seven sets of marks awarded to naval electrical mechanics during 
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the SIX months’ workshop training and theorj" course, most of them 
being based on standard test pieces in fittmg, turning, etc. While 
seven sets of theory examination marks gave unrotated factors 
with variances 46 * 1, 8 • 1 and 4* 1 (total 58 *4 per cent.), the average 
result for three analyses of objective practical marks was 28-6, 
10-6, 4*7 (total 44*1 per cent ). If the bipolar factors were con- 
verted into group factors, we should probably find that the group 
factors, corresponding to different types of job, are more prominent 
than general workshop ability. A specimen analysis of fitting jobs 
is shown in Table XIX. The second factor differentiates the first 
month’s practice jobs from the second month’s jobs and tests. 

TABLE XIX* CENTROID FACTORS IN WORKSHOP PROFICIENCY 
MEASURES AMONG 122 NAVAL ELECTRICAL MECHANICS 


Marks 

I 

II 

III 

h* 

Job No. 5478 

•55 

-19 

•15 

•36 

» 5848 

•80 

•37 

•08 

•78 

„ „ 5001 

•59 

•42 

—•21 

*57 

„ „ 5050 

•62 

•13 

—20 

•44 

5867 

•47 

— 36 

•35 

•48 

„ 5838 

•54 

— 20 

•49 

*57 

Practical Test 1 

•46 

— 26 

— 28 

•35 

.. 2 

•54 

— 28 

— 38 

•52 

Variance per cent 

33-8 

8-4 

8-8 

50*9 


The third differentiates Nos. 5478 and 5848, which involve angles 
other than right angles, from Nos. 5001 and 5050 which involve 
right angles only. No. 5848 which brings m the greatest number 
of different angles has the highest general factor loading and 
communality. The third factor also separates the second month’s 
precision jobs from the tests which were done under examination 
conditions without any advice from instructors. Thus different 
conditions of work as well as different types of job affect the cor- 
relations. Turning, shaping, thread-cutting, and other operations 
similarly yielded partially distinct factors in other analyses. The 
contrast between theory and workshop analyses is even more 
striking when it is pointed out that the trainees were extremely 
highly selected for intelligence, mathematical and electrical know- 
ledge, whereas they were unselected (apart from good Test 2 and 
4 scores) on the practical side, and were drawn from a great 
variety of — mostly non-mechanical — occupations. 
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The conclusion follows, then, that though a general ability at, 
or aptitude for, mechanical work does exist, quite apart from g, it 
IS of small extent when objectively assessed, and factors specific 
to the particular type of operation or machine are moie important. 
Each test job, occupying the trainees several days, may be aptly 
compared to a single mathematical test item, in reliability and in 
overlapping with other jobs or items. Hence, a very extensive 
sampling of jobs over months would be needed to yield a reliable 
objective criterion of workshop ability, although doubtless an 
experienced instructor could arrive at a fairly reliable — ^yet more 
subjective — estimate in a briefer time. 

The only comparable study in the literature is that of the 
Minnesota investigators, who unfortunately do not report the 
overlapping between the various trade operations learned by their 
subjects. They do, however, mention that the correlation between 
the first ten weeks (fifty hours) of the boys’ shopwork and the next 
twenty weeks was only -42, which agrees well with our figures. 

Test Factors and Job Factors. How far test factors and job 
factors overlap it is difficult to say in the absence of investigations 
where proficiencies at several types of work are compared with 
numerous psychological tests. One such study was conducted by 
Wheeler (1948) among R.A.F. airframe fitters Analysis of course 
marks alone yielded a general factor with SO per cent, and two 
bipolars with 15-5 per cent, of variance. The latter difterentiated 
the marks awarded under examination conditions from those given 
during the course, also the more practical from the more theore- 
tical marks. So far this duplicates our results quoted above for 
engine room and electrical mechanics. When the marks were 
analysed with thirteen psychological tests (cf. Table XIV) the 
most ‘practical’ tests 117M, Wirebending and Dexterity grouped 
with the course marks against the other ten tests; and a second 
bipolar contrasted the theory marks and g -1- v:ed tests with the 
practical marks and k:m tests. 

^ Numerous follow-up studies (cf. Vernon and Parry, 1949) have 
similarly shown better validity for v:ed tests in work of a clerical, 
verbal or theoretical nature; k-m tests have obtained relatively 
higher validities in work of a practical nature, though v:ed tests 
have been useful there also. At the same time, however good the 
battery of tests, validity is very far from perfect, indicating the 
presence of the additional' Z factor or factors in jobs. Thus the 
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structure of job abilities appears to parallel closely that of educa- 
tional abilities, and may be roughly represented as in Fig. 7. 
G, X and v:ed together make up general occupational ability; this 
may be partially split into verbal, mechanical-practical and possibly 
other types (e.g. managerial, dealing with people, etc.), which 
themselves sub-divide indefinitely. As in education again the 
general factor and small group or specific factors cover much more 
variance than the major types. 


General Occupational Ability 



Fig. 7. Diagram illusjtratmg the Structure of Occupational Abilities 


We have no sure information as to whether jobs group in accord- 
ance with the other main test factors described in preceding 
chapters. Nevertheless, it is obvious that some jobs require much 
stronger physique than other, so that our physical factor may link 
up with these. In spite of the low validity and specificity of manual 
dexterity tests, they do often yield moderate correlations with 
machine operating and assembly jobs. The Division of Occupa- 
tional Analysis claims that its non-verbal perceptual factor is 
especially relevant to visual inspection jobs in industry, but 
provides no evidence of validity for this, nor for the three psycho- 
motor, factors. It is interesting, none the less, that the Matrices 
test and Group Test 70 (although apparently little dependent 
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upon P) gave far their best validities during the war with visual 
signalling, radar and certain A.A. jobs involving visual perception. 
The relevance of the verbal perceptual factor, Q, to clerical jobs 
is well established. No close relation between the spatial factor 
and jobs that appear to involve k has been established, though it 
does link with mechanical ability m adolescents, and with some 
kind of practical efficiency in adults (cf. Vernon, 1947b). The 
mechanical information factor is much more highly correlated 
with success at mechanical jobs in general, also specialized trade 
knowledge tests with specialized trade skills. 

Conclusions for Vocational Guidance and Selection. It 
would seem that we cannot hope to cover much more ground in 
the prediction of vocational suitability by means of spatial, per- 
ceptual, manual, or by rote memory, attention and other types of 
test than we do already with g, ^:ed and k:m tests. Certainly it is 
worthwhile exploring the relevance of the established test factors 
for groups of jobs. But it appears unlikely that vocational guid- 
ance can ever be reduced to the application of a battery of tests for 
measuring all the mam ability factors, and expecting an individual’s 
pattern of factor scores to tell us what jobs he is suited to, though 
this is the aim of the Division of Occupational Analysis and of 
many American psychologists. In order to tap minor job group 
factors, we should almost certainly have to resort to more elabor- 
ate, expensive and time-consuming work-sample tests. These 
have their place in vocational selection but are hardly practicable 
for guidance. Psychologists giving guidance are justified in making 
the fullest possible use of g, v:ed and k:m tests, but thereafter their 
success is likely to depend chiefly on the extent to which they can 
gauge each candidate’s previous relevant experience, and evaluate 
both his more general motivation or X, and his more specific 
attitudes to the jobs under consideration. 



APPENDIX 


GENERAL + GROUP FACTOR VS, MULTIPLE FACTOR 

THEORIES 

D ivergences between the methods and conclusions of 
British and American investigators in the field of factor 
analysis are much less acute now than they were ten years 
ago. But there is still sufficient difference in their views on mental 
structure to make it essential to show why general + group-factor 
solutions are considered superior m this book. 

Factorial Techniques. It should be pointed out first that 
Thurstone’s centroid technique, being perhaps the simplest to 
apply, is very widely used in Britain. The main difference is that 
we either do not rotate the resulting factors, but use them to 
indicate what group factors are present before starting a true group- 
factor analysis, or else we rotate in such a way as to maximize 
rather than minimize a general factor. Conversely in America, 
Holzinger has always favoured group-factor techniques, and R. B. 
Cattell (1946) regards the hierarchical picture of abilities as 
superior to the multiple factor. There are a few minor differences 
of technique. Some British workers prefer Burt’s original Simple 
Summation to the Centroid method, though the latter has a neater 
device for reflecting signs when extracting bipolar factors. Usually 
also we guess the communalities for insertion in the diagonal cells 
and repeat the analysis several times until the guesses approximate 
to the correct values, instead of — like Thurstone and his followers 
— inserting the highest correlation in each column at each stage. 
Thurstone’s short-cut is liable to exaggerate the communalities, 
and therefore the size and number of the later factors (cf. Burt, 
1938). But Burt’s more accurate successive approximation tech- 
nique is hardly practicable when the number of tests is large, say 
fifteen or more, or the number of factors is large. 

Number of Statistically Significant Factors. A more im- 
portant difference between the methods commonly used by fac- 
torists in the two countries is that British writers often stop at two 
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factors (or g + two group factors) and seldom exceed four, whereas 
Americans seldom extract less than five and often exceed twelve. 
Both agree that factors should satisfy a criterion of statistical 
sigmficance, but American criteria seem to us so lax that many of 
the later factors they obtain from smallish populations must be 
considerably distorted by chance errors. Spearman (1939) 
strongly criticized Thurstone’s original primary factor study on 
these grounds. Unfortunately centroid analysis (unlike Maximum 
Likehhood, Prmcipal Components and some other techniques) is 
mathematically, a rather rough and approximate technique, 
hence its exact sampling errors are unknown. It seems probable 
that the chi-squared tests advocated by Burt (1940) are unduly 
stnct, and that the checks popularly used in America, namely, 
Tucker’s and Coombs’s, are too lax. But Burt and Banks’s (1947) 
most recent formula for the Standard Error of single loadings,^ 
and McNemar’s (1942a) logically derived formula for the signifi- 
cance of residuals, have both been checked empirically and, in the 
writer’s experience, yield much the same results. Guilford and 
Lacey’s criterion is the simplest,^ and though much more rough, 
it too agrees in showing that most British factorists have in the 
past been too cautious, most American ones too indulgent. 

Disagreement Regarding G. There is thus no essential dis- 
agreement on mathematical points, and provided that group-and- 
multiple-factor analyses account equally well for the original 
correlations by means of the same limited number of factors, they 
are equally legitimate. Moreover, many American factorists do, 
like British ones, find a g-factor nowadays, as was shown in 
Chapter II. But it still remains true that British writers make g 
as large as possible, and posit group factors only when the resi- 
duals necessitate them, whereas Americans either introduce g as a 
second order factor, .or, if a primary one is unavoidable, tend to 
minimize it. Again British workers recognize larger or more com- 
prehensive group factors together with sub-factors ‘descended’ 

(1— j 4) ^/ n 

^ S.E. of a loading, r= / where n— number of tests, and 

V N(n — s-f- 1 ) s = the number of the factor. 

The writer would suggest that about half the loadings should exceed twice this 
S.E. for a factor as a whole to be regarded as significant. 

* Namely, that the product of the two highest loadmgs m a factor, regardless 
of sign, should exceed the S.E. of zero r. Unfortunately Guilford often fads to 
adhere to his own rule, hence several of the factors claimed in his U.S.A.A.F. 
researches are certamly not significant. 
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from them, whereas American primary factors more often all 
possess much the same status and variance. Not only do such 
primary factors seem, from our standpoint, to carry some of the 
variance that would be better attributed to g, but also one or more 
of them (usually a Reasoning factor) may consist wholly of g. 

The main arguments are as follows. 

(1) Size of G in Unselected Populations. G is so much larger 
than all other factors put together in unselected populations of 
adults or children that it is psychologically foolish as well as mathe- 
matically difficult to belittle it. It may legitimately be asked why 
representative populations should be taken as a standard, rather 
than selected ones such as college students, among whom it is often 
quite difficult to establish a^. The writer would agree that analyses 
of selected groups like Thurstone’s and Guilford’s are useful for 
bringing up group factors which might otherwise be obscured. 
But Thurstone (1945) himself acknowledges that selection, though 
not affecting the main factor pattern, does distort the sizes of the 
loadings. And he points out that when selection is complex (based 
on several variables) artificial factors may be introduced. In most 
investigations of college students selection is based on a great 
variety of socio-economic, educational and other influences which 
are unlikely to be correlated with any factor other than g. Hence 
the ^-variance only is reduced and no great harm is done. But 
Guilford’s pilots were mostly doubly selected, by the U.S.A.A.F. 
Qualifying Examination and by the Aircrew Aptitude battery, 
that is to say by tests which are themselves more closely correlated 
with some of the tests to be factorized than with others. And this 
is likely to have played havoc with the obtained correlations and 
the resulting factor patterns. True Dudek (1948) has tried to show 
that the same factors do emerge in a group selected by the Qualify- 
ing Examination, and in a group of women pilots, as in an un- 
selected group of candidates. But in fact he gets parallel results 
only for five of the most commonly accepted factors; the smaller 
factors differ considerably in these three populations. Moreover, 
none of these groups had been doubly selected as had many of 
those in which Guilford’s more dubious Reasoning, Integration 
and Spatial factors occurred. 

(2) Greater Stability of G and the Major Group Factors. 
Burt has claimed that group-factor solutions are more invariant or 
stable than primary factor ones, that is less liable to vary in differ- 
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ent populations or with alterations in the battery of tests. 
While this might be difficult to prove, it is clear that g, the major 
group factors and some of the minor ones almost always turn up 
in much the same form in any reasonably representative group. 
In contrast American writers have described at least a hundred 
ability factors of one kind or another, which are only partially 
reconcilable with Thurstone’s, though it is true that Thurstone’s 
own results show greater uniformity. Balinsky’s (1941) investiga- 
tion of the Wechsler-Bellevue scale is interesting in this respect. 
In all his age groups there was a clear differentiation between 
verbal and performance tests, but no other consistent factors were 
found in several successive groups. The Division of Occupational 
Analysis also found considerable variations in their nine studies, 
but the most stable factors were V, N and S. Surely then it is more 
logical to stress the major general and group factors which almost 
always turn up and whose statistical significance is indubitable, 
and to admit that the imnor ones are so dependent on the parti- 
cular set of tests and the heterogeneity and background of the 
particular populations tested that they do not merit the designation 
‘primary’. 

(3) Quickness of Group and Multiple-Factor Techniques: 
Relative Subjectivity. Group-factor analysis is very much 
quicker. Given the correlations between twenty tests the present 
writer can usually perform the analysis in a day, where a controid 
analysis with rotation would take him a week. Banks (1948) makes 
a similar observation in re-analysing some of Cattell’s personality 
data. Even when a preliminary centroid analysis is made in order 
to indicate objectively what group factors are present, the time 
saved on successive approximation to communalities and on 
rotation is very considerable. It must be admitted, however, as a 
counter-argument, that there is rather a large element of sub- 
jective choice in most group factor analyses, even when they are 
guided by centroid results.^ One naturally tends to aim at factor 
patterns which are consonant with those obtained for the same 
tests in previous studies, and may therefore fail to realize that some 
of these preconceptions are wrong. Thurstone would claim that 
rotation to simple structure is objective, i.e. that there is one best 

^ When every test depends on g and one group factor only, and when there are 
at least three such group factors, then group factor analysis can be done com- 
pletely objectively by Burt’s formula (1940a, p.482), or by Holzinger’s bifactor 
method. But such a simple pattern is rather rare. 
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solution to each factorial problem which maximizes the number of 
insignificant loadings. Indeed some factorists such as Cattell 
actually carry out their rotations without knowing which test is 
which. But it is obvious that the majority are less strong-minded, 
and that their rotations are made in the light of their judgment of 
the content of the factorized tests. Centroid analysis may therefore 
be as subjective as group factor. 

(4) Divisibility of Primary Factors. As would be expected 
on the hierarchical theory, more detailed investigation of parti- 
cular primary factors frequently splits them up into smaller com- 
ponents. Examples are provided by Davis’s and Langsam’s studies 
of reading ability, Guilford’s of the spatial factor, Taylor’s and 
Carroll’s studies of V and W, etc. True, Thurstone himself is 
careful not to claim that his factors are ultimate mental elements, 
but if the term primary is used, it is curious that they should be so 
unstable. When Simple Structure is aimed at, there is no ready 
means of indicating that some factors are more comprehensive than 
others, or can include others. Oblique and second order factors 
could indeed cover this situation, but no American factorist has yet 
systematically analysed a varied battery of tests into several higher 
order factors and sub-factors, along the lines of the group-factor 
analyses in Tables V and IX. 

Unless some such plan is adopted it is difficult to see where 
factorization is to stop. Guilford’s work indicates that almost any 
specific factor (in Spearman’s sense) can be turned into a primary 
factor, given sufficient ingenuity in test construction. The view 
was put forward in Chapter III that highly specialized factors, 
which have no appreciable significance for everyday life, are not 
worth isolating. If this is agreed, it necessitates some distinction 
between the more fundamental factors, which are preferably ex- 
tracted first, and the narrower ones. In other words, factorists 
should aim not merely to reduce large numbers of variables to a 
few components that account for their inter-correlations, but also 
to reduce them to the fewest components which will cover most 
variance. 

(5) No Test Measures a Single Factor. In spite of many 
efforts, no psychologist has been able to devise a test which is truly 
‘univocal’, i e. which measures only a single primary factor (apart 
from error variance). Guilford and Michael (1948) admit that in 
order to measure a person’s factor score, it is usually necessary to 

K 
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add ‘suppressor variables’, that is to subtract weighted scores on 
other tests in order to eliminate the unwanted g or other content. 
Why not admit then that all tests do involve g, instead of arti- 
ficially removing it by means of rotation? 

(6) Hierarchy a Statistical Artefact? One argument in the 
opposite direction is that the notion of hierarchy arises merely 
because centroid analysis yields a general factor and a series of 
bipolar factors sub-dividing the tests into smaller groups. This 
might be answered by pointing out that group-factor analysis pre- 
ceded multiple factor in psychology. Moreover, in most analyses, 
g differs considerably from the first centroid factor, and group 
factors do not necessarily correspond to the bipolars; any one 
group factor often combines parts of the variance of the first 
factor and of two or more bipolars. Finally, we have admitted 
(p. 25f) that the strict hierarchy of Fig. 1 is an over-simplification. 

(7) Psychological Soundness of Procedures Deriving from 
the Opposed Theories. Perhaps the most important objection 
to a ‘neo-faculty’ viewpoint is that it tends to encourage undesir- 
able practices in educational and vocational guidance. Although 
factorists themselves are well aware of the dangers of the ‘naming 
fallacy’, the users of their tests are not. If testers are told that a test 
is a good measure of the verbal factor, or the memory factor, etc., 
they jump to the conclusion only too easily that such a test will 
predict ability for any job, or type of education, which seems to 
them to involve verbal ability, or memory. The Division of 
Occupational Analysis’s General Aptitude Test Battery provides a 
flagrant instance. Personnel officers will naturally suppose that 
candidates for jobs apparently requiring hand-eye co-ordination 
should score well on the hand-eye co-ordination factor. But in 
actual fact the score for this factor is based purely on two tests of 
drawing lines and making dots accurately, and there is no evidence 
whatever as to the validity of these tests for any job. Previous 
research would suggest that it is extremely low. We do not 
question the great utility of batteries of differential aptitude tests 
such as this one, or Guilford and Zimmerman’s (1948) and others. 
But they should not be published without objective evidence of 
their correlations with job success; and if, as is usually the case, 
the factonal structure of the tests is complex and their loadings 
TOth practically important factors over and above g and v:ed or 
k:m is small, this should be made clear. 
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This point is much more obvious to the vocational or educational 
psychologist who accepts the hierarchical theory. Moreover, this 
theory considerably simplifies his or her task. Instead of having to 
apply a very lengthy battery such as the G.A.T.B. to all candidates 
for guidance, in order to determine their profiles on ten factors, he 
realizes that a short battery of v:ed and k:m tests, which can be 
given and scored in one hour, will take him a very long way.^ It 
will cover most of the ground in educational or vocational predic- 
tion that can ever be covered by tests. He can, however, proceed to 
give tests of further minor factors (perceptual, manual, etc.) or 
tests of a more work-sample type referring to particular jobs, in 
order to achieve a gain in accuracy of some 5 to 10 per cent., if this 
appears to be needed in any individual case. Again, the hier- 
archical viewpoint justifies the use, for many purposes, of the LQ. 
or of comparable measures of adult intelligence, that is of g or 
^ 4“ ^ tests ^alone.^ To the multiple factorist the LQ. is meaning- 
less compound, and Thurstone, Guilford and others strongly 
advocate the substitution of tests of half a dozen, or a dozen or 
more, factors. But in spite of their criticisms, the LQ. is found as 
useful and convenient by American practising psychologists as by 
British ones. Even if they do not realize it, such psychologists are 
thereby committed to a hierarchical rather than a multiple-factor 
viewpoint. And they are less likely to be led astray by their present 
procedures than they would be by ambitious schemes for diflcren- 
tial testing of all the main factors. 


^ Note that the combmed score on such a battery will provide a measure of 
Without the need of any tests of g alone Similarly it follows from the hierarchical 
theory that scores on tests of minor group factors will add up to give a measure 
of the major group factor from which they derive. 

* Similarly it j’ustifies the use of general reading attainment tests instead of 
separate tests for different reading factors which Davis and others* advocate. 
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Composition, 15, 41, 43, 52-4 
Comrey, A L., 37, 39, 139 
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Coombe, G., 1 

Coombs, C. H., 41-3, 122-3, 
130, 139 

correlation, implications of, 4-6, 
11, 13, 27n, 79 
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93 

Cox, J. W., 101, 110, 139 
Cox mechanical tests, 17, 112, 
117-8 
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Cube Counting, see Block Count- 
ing 

cultural effects, 32, 95, 106 
Cureton, E. E., 50, 139 
Curran, D , 57, 139 
Cylinder Construction test. 111 

D, see deductive reasoning 
Davidson, W. M., 54, 81-3, 139 
Davis, F. B , 21, 45-6, 55-6, 114, 
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Dearborn, W. F., 3, 33, 139 
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Dempster, J. J. B., 54-5, 68, 110- 
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Dewar, H., 93, 139 
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Dictation, 23, 39, 41, 100, 116-7 
difficulty of tests, 77, 79-80, 84 
Directions tests, 44-5, 62, 68, 84 
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Doppelt, J. E., 29, 139 
dotting tests, 84, 90-1, 96, 103 
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Dvorak, B. J., 21, 69, 140 
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30-2, 34-5, 42, 65 
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74, 77, ill, see also Reading, 
etc. 

ergograph, 97, 99 
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factor analysis techniques, 4-10, 
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fluency, 19, 33, 39, 41, 45, 50-3, 
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Galton, F., 11 
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Geography, 15, 40, 46, 73 
Geoige Washington Social In- 
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Gordon, IL, 64 
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Halstead, H., 60, 140 
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chomotor abilities 
Handwork, 16, 40, 47, 67 
Handwriting, 13, 15n, 16, 43, 
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Hargreaves, H L., 52, 83, 143 
Harman, H H., 20, 50, 54, 66, 
143 

Harrell, W., 102-3, 143 
Hams, C. W., 43, 143 
Hartshorne, H., 12, 143 
Healy performance tests, 107-10 
Heese, K. W., 78, 143 
heterogeneity, 16-7, 28-31, 34, 
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Hertzman, M., 79, 143 
hierarchical pattern of correla- 
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hierarchical structure of abilities, 
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Highmore, G., 104, 143 
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History, 8, 15, 39-40, 46, 73 
Holley, J. W., 93, 142 
Holzinger, K. J., 20, 37, 50-1, 
53-4, 58, 60, 62-3, 66-7, 83-4, 
129, 132n 
Hotelling, H., 24 
Howells, T. H., 91, 143 
Hsu, E H,, 58, 88, 144 
Hygiene ability, 40 

z, see imagination 
7, see inductive reasoning 
imagery, 17, 33, 88-90, 93-5 
imagination, 49, 51, 53-4 
inductive reasoning, 19-20, 41-2, 
45, 50, 54, 56, 108 
Ingham, J, G,, 60-1, 144 
inkblot tests, 53, 90, 94 
Instructions test, 23, 74 
integration ability, 22, 58, 62-3, 
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intelligence, 3-5, 7, 12-3, 38, 54, 
57-8, 77, 122 

intelligence tests, 3, 5-7, 13, 28, 
30, 34, 44-5, 53-4, 56-7, 59, 

71- 3, 79-84, 92-3, 96, 99, 106, 
no, 116-8, 135 

intelligence tests, non-verbal, 17, 
19, 32, 55, 65-73, 82, 108-11, 
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19, 30, 32, 39, 49, 64-5, 67, 
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interests, 9, 16, 19, 33, 35-6, 
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Jaensch, E., 90 
James, W., 12 
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Jones, F. N., 88, 144 
Jones, H. E., 104, 144 
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k, see spatial abilities 
Karim, J. E., 91-2, 144 
Kelley, T. L, 16, 24, 55, 60-1, 
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Kent-Shakow formboard, 100, 
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Kerr, G., 38-9, 68, 144 
k . w, see practical abilities 
Knight, A. R., 3n, 144 
Knox Cube test, 108-9 
Koerth Pursuit Rotor, 97 
Kohs, S. C., 109, 144 

L, see logic factor 
Lacey, J. I., 21, 27n, 41, 43, 56, 
61-2, 69,71,80,91,99,130,142 
Langsam, R. S., 45, 133, 144 
Language abilities, 1-2, 8, 15, 
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length estimation, 22, 75, 78, 94 
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